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gatccgacca caccagcccg. gcaccccgcg gggatactcg ccccgtccgc cgtccggaga 60 

tcgctgcccc gcgccaccgc ctggccggca cgctgctgcc gctacgccac cagggccgcc 120 

gcgcctgcct tcagctccac tgcgtccatt gccggacccg gcttggccac gccagccgga 180 

ggccccgcca ccgagcacct gggccgaccc cgccctggcg ccgatacgca gtcggacgcg 240 

acccggcgag cgtggttggc gacgcatggt gcggctggtc acctttggcc ttgtcggcct 300 

gggccggtcg ggcatgcagc gccaggaggc ccaattcgaa gcaacgatac gaaccgtcct 360 

gcatggcaac cacaaggtcg ccgtgctggg caaaggaggt gtgggaaaga cgtcggttgc 420 

ggcgtgcgtc ggatcgatcc ttgccgaact gcgccagcag gaccgtatcg tcgggatcga 4 80 

cgccgacacc gccttcggca ggctgagcag ccgaatcgat cctcgagcag ctggttcgtt 54 0 

ctgggagctg accaccgaca cgaatctgcg gtccttcacc gatatcaccg cgcgcctggg 600 

ccgaaattcc gcgggactgt acgtcctggc aggccagccg gcatccggtc cgcgccgggt 660 

gctcgatccg gccatctacc gcgaagccgc cctaaggttg gatcaccatt tcgcaatctc 720 

ggtgatcgac tgcggttcct ccatggaggc ggcggtcacc caggaagtat tgcgcgatgt 780 

ggatgctctg atcgtggtgt cctcgccctg ggcggatggt gcctccgctg ccgccaacac 840 

catcgaatgg ctgtcggatt atggcctgac aggtttgttg cgacgcagca tcgtggtgct 900 

caacgattcg gacggacacg ccgacaagcg caccaagtca ttgctggccc aggaattcat 960 

cgaccacggg cagcctgtgg tcgaggtgcc cttcgatccc catttgcggc ccgggggggt 1020 

catcgatatg agccacgaaa tggccccgac gacgcggctg aaaatcctgc aggtcgccgc 1080 

gacggtgacg gcgtacttcg cgtcgcgacc cgccgacgca cacggcagcc cgccccggtg 1140 

acctggctgg ctgacccggt cggcaacagc aggatcgccc gagcgcaggc ctgcaaaacg 12 0 0 

tcaatctcgg cgcccatcgt cgaatcctgg cgggcgcaac gcggcgcgca atgtggacag 12 6 0 

cgcgagaaat cttgtcgatg ttctcgcgct gtccacatcc agggcatctc accgccactg 132 0 

ttccgcagac ccctcgaacc agcggtccag gcggcggttg cgtcatgccg attgggcaga 13 80 

cacccggtgg tcgcgcaccg ggtaaccgtt gcgctcggcc agggatcgca gctggcccaa 144 0 

cgcgaatgcc cgcgcccggc ctgattcggg aattacgacc cctgcccaca gcccttccgc 1500 




acccgcggac tcgacggcgt cgcgtgcaca 
ggtcttggcc tcgtcgtcgg gagtcgtcgt 
gagcgggacc tcatacaggg cggttactgt 
ggttgtagcc tctgccgcga aagcgtatcg 
ctctgcctgc ctagcggtgc tgcggctccg 
accgccgagt accagggcat agatcctgtt 
gtatccgacc ccttcgggca gatcttccag 
tgtgaagtga actgtagcgg cagttcggtt 
tcgcgtcgct agatccaaaa tgtagcgaag 
atggcgggcg agcggaaagt ctgcccaccg 
acgcagatgt ctaaagcagg gtcgactgtc 
ggcacatcag acgtgattga gccccgtcgc 
gtcggcaccc agatcgacga ttctcgaatc 
gatggacggt ggcggatcgt cggcaacatc 
ggcagctcgg tgaccgtcag cgataagacc 
aaggcgttga cgttcgaagt cgtcaggccg 
caaccatcag cggacctgtc ggacgacccg 
cccggcgtgg ttcgcgcagg ggcggccgcg 
caacgcagct tggcggccga cgggatcatc 
ggccgtagtt ggccccggga acggacccgg 
gctggaacca tcgcgcgaat ccgtcggggc 
gcgtcccccg gactccggcc tgccgacggc 
gccgccgtag acggctgcag tctggctatc 
ttcaccgaac gtgccgcgcc gatccttgct 
caagcaaccc gcatcagccg gattaccccg 
cgccaccacg acgaattaat gaggctggga 
cgcttatatg ccgcacggcg gcgcgcgaac 
ggcgtcgcag aagaaatgat cgtcggcgcc 
accgaagcga tcgaagcact gatccgtcag 
aagtacaggc acgccgtaac gctcaagttc 
cgtttgacgg tgcgttccat cgtgccgcga 
cgtgcaccaa cccccctccg agcgctgcta 
atgatggcgc accggaacgg cgagcgtaag 
tttgcaagac gacgagtaac tttatttggg 
ccgacccagg cgacattttc aacaccggtt 
tgggactcgc cattccgggc accaactgga 
agaacatcgc gcaacaactt cgcgcacagg 
acatgatctc gaatcaggcc aaatacgtct 
agaagatgat tgacggtgtc tacaaggttt 
gccacttgtg gtcgtgggag ctcgcaatcc 
gcggcgcatt gctctatcta acgattatga 
ttctcggcag gctgatcgag atgttgacga 
tgcccagcct gcccgacatc atcgacggcc 
tccccggcct gcccgacatc ccgggcctac 
gcccgttgtt ccccgacctc ccgtcgttcc 
ccatccccgg gttccccgca ctgcccgggt 
tgccgggtct gggcgacctg ctgcccggcg 
ctgagctggc cgctttgcct gacttcttgg 
ttggcaatct gctcagcttt gccagtttgc 
gtcagctgca acagctcgtg gcggccggcg 
gccaacaagc gcaactgatc tcgtcgcagg 
tcgtgagcga caagaaggaa gacgaggaag 
acgctggcac cgcggccagc caacgggggc 
agtcgccagc aggtctgtgc catagcgagt 
agaggagggt tcaacccatg accggatttc 
tggcgggcat gcacaacgag atcgtgggtg 
ggattagcgg acgagttcag .cttacccatg 
tgcaagagtt tgagaccacc cgtagcagca 
gactggccaa taatctgctc gcagccgccg 
ccggtgttat cgacaagatt ttcggttgat 
gggcaccgcc gacaacgtgg tcggcgtcga 
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cagccaccgg cgcgggcaag cccggcacag 1560 
ccaacgatcg ggatcttgcg tgcaaacgcc 1620 
catgtctacg ttcctccaga aagcgttgca 1680 
cattaaccat agcgatgcaa cagtttcctc 1740 
gttcggcgag ctccgagctc tagtgcgcgc 1800 
aatcagctgt gtatctggcc tcgccggcgc 1860 
gaaaagtgtt ctgacatgcg acagttcagg 1920 
tggctaggaa actatttcca tagcgggccg 1980 
tcatagcagt agaagggtgc aacggttagg 2040 
tcccggctag tacccgcgaa taagggatca 2100 
ggaccggcgc cgctggtcgc gtgcagcggc 2160 
ggtgtcgcga tcattggcca ctcgtgccga 2220 
tctcagacac atctgcgagc ggtatccgat 22 80 
ccgagaggta tgttcgtcgg cggacgacgc 2340 
ctaatccgat tcggcgatcc ccctggaggc 24 00 
tcggattccg ctgcacagca cggccgcgta 24 60 
gcgcacaacg ctgcgccggt cgcaccggac 2520 
gctgcgcgcc gtcgtgaact tgacatcagc 2580 
aacgcgggcg cgctcatcgc gttcgagaaa 264 0 
gcaaaactcg aagaagtgct gcagtggccc 2700 
gagcccaccg agcccgcaac aaaccccgac 2760 
ccggcgtcct tgatcgcgca ggctgtcacc 2820 
gcagcgttgc cggcgaccga ggaccccgag 2880 
gatttgcgcc agctcgaggc gattgccgtc 2940 
gaattgatca aggcgttggg cgcggtacgt 3 000 
gcaaccgccc ctggtgccac actggcgcag 3 060 
ctttccaccc tggagactgc ccaagcggcc 3120 
gaagccgagg aagagttgcc agccgaggcc 3180 
atcaattgag gtcggctccg agcgtcccac 3240 
aacggtccgg ggaacgcgcg cgttctccgg 3 3 00 
acttgaaaac gccagcgtca ccaaaaaatt 33 60 
agctcaatgt gcagtgcaaa ggtgcagata 3420 
gaaacacata aatggcatcg ggtagcggtc 3480 
gccagttact cttgcttgga gagggaatcc 3540 
cgtcgctgtt caaacaaatc agcgacaaaa 3 6 00 
tcggccaagc ggcggaagct tacctaaacc 3 6 60 
tgatgggcga tctcgacaaa ttaaccggca 3720 
ccgatacgcg cgacgtcctg cgggccatga 37 80 
gtaagggcct cgaaaagatt ccgctgctcg 3 84 0 
ctatgtccgg catcgcgatg gccgttgtcg 3 900 
cgctgatgaa tgcgaccaac ctgaggggaa 3 960 
ccttgccaaa gttccccggc ctgcccgggt 4 02 0 
tctggccgcc gaagttgccc gacattccga 4080 
ccgacttcaa atggccgccc acccccggca 4140 
cagggttccc cgggttcccg gagttccccg 4200 
tgcccagcat tcccaacttg ttccccggct 4260 
taggcgattt gggcaagtta cccacctgga 4320 
gcggcttcgc cggcctgccc agcttgggtt 4380 
ccaccgtggg tcaggtgacc gccaccatgg 4440 
gtggccccag ccaactggcc agcatgggca 4500 
cccagcaagg aggccagcag cacgccaccc 4560 
gcgtggccga ggcggagcgt gcacccatcg 4 62 0 
aggaggggac cgtcctttga tcggacaccg 4680 
cgaagccata gcgagtagaa agttaaacgt 4740 
tcggtgtcgt gccttcgttc ctgaaggtgc 4800 
atatcaaaag ggcgaccgat acggtcgccg 4 860 
gttcgttcac gtcgaaattc aatgacacgc 4920 
cgggcacggg tttgcaggga gtcaccagcg 4 980 
gcgcctacct caaggccgac gatggcctag 5040 
catgacgggt ccgtccgctg caggccgcgc 5100 
ggtaaccatc gacggcatgt tggtgatcgc 5160 




cgatcggtta cacctggttg atttccctgt 
agaggatctg cgagacatcg tctgggaaca 
gctcgacctc cacggggagc cccaaccgac 
gccagatcgg accttggagg gtcgctggtg 
cttcgtcgtg tgccgcaggg gcgaccgcca 
ggtgctgcag ttggtggcgc cgcaggtcgg 
gcccgccgaa cccgccaacg tcgaacccct 
cacaaccgcg tcccaattga cgcaatacgg 
cgagatcgtg ggtaacccga ccggctgggt 
cggcaccacg acgcagaccg acgccgccgc 
ggtgtcgctt ccccgccgtg ttggaggcga 
gcagaacttg gagcgtgcgc tggacggctt 
agatcacacc tcagatcacg cacaagcctc 
acgacttcag aaagggacgc catggtggac 
ctcgacgccc tcgatttctc cgccgcccac 
gacgactatg cgccggtgca gaccgatgac 
ctcaccgaac gcgacgagga gccggagctg 
tcggtgtcgg tctcaaccct gatggacggc 
gcgaccagca tgtccgaagc gcagctggcc 
cgccaaaagg cgcgggcgtc gcagtacacg 
gacgaagacg cagaaggcag cgccctgctg 
ccgacgccgg aagaggctgc cgcagccgaa 
gactacacct cccggtacaa ggccgatgac 
cgtcagcatg ttgccgatgt cggaggcgcg 
ctacgacgaa tccgcttgcg acgcatggat 
ggtgacgctg tttcgcgcct ggtattcgcg 
ccagatctcg atgagcacgt taaacgccag 
cacctacccg gtcacctcgc cgctagcgat 
gcaaggcaat tacgccgacg ccatggaggc 
gcacctggtg gcgtggatga aggcggttgt 
gatcgaccag gtcaagagtg ctgggaaatg 
tgtggcgcac ggggttgccg cggcaaacct 
caccgaggcc aacgactcge ccgccggtga 
ggcgatggca cggcgcagcc agggcaacga 
acagaccact caccccgagc ccaaagtggc 
gaagacgacc accgccgaac agatcgcatc 
cgtgaccgac aactccggcc gggagcggct 
ccaaattggg ctcacccggg ttaaaaatca 
ggcccgggtc cgcgccgcca agggtatgaa 
caccggaccg cccggtaccg gcaagaccac 
cggcttaggc gtcattgccg aacccaaact 
cgagtacgag gggcaatcgg cggtcaagac 
ggtgcttttc atcgacgagg cttatgcgct 
gttcggtcaa gaggcgctgg acacgctgct 
ggtggtgatc atcgccgggt acagctccga 
tctgcggtcg cggttcgcca ctcgcatcga 
cgagatcgcc aacgtcattg ccgctgctga 
gaactttctt caggccgcca agcagttgga 
ggacgtcgcc. ggcaacggtc ggtatgcgcg 
ggacatgcgt ctagcccagg tcctcgatat 
gatcaacggc tcagatatgg cggaggctat 
agaatgaact atggggcttc gcctcaccac 
gctgcgccgg ctcgaacacg ccatcgtgcg 
gcagttctac agccgctcga tcgctcttgg 
tgccgcgctg ctggcgtact tcaaaccaca 
cgaccgcgcg accaaccagc tttacgtgct 
. cctgacttcg gcgcggctgg tgctgggcaa 
cgaactgagc aagctgccga tgggccagac 
gcctgtttcg gcgggcagca cctcgatctg 
ctccacttcc ccggtagtgc agaccgcggt 
gatcgatccg ctccagtcac acgaagcggt 
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cacgcttggg attcggccga atatcccgca 522 0 
ggtgcagcgt gacctcacag cgcaaggggt 52 8 0 
ggtcgcggag atggtcgaaa ccctgggcag 5340 
gcggcgcgac attggcggcg tcatggtgcg 5400 
tgtgatcgcg gcgcgcgacg gcgacatgct 5460 
cttggcgggc atggtgacag cggtgctggg 5520 
gacgggtgtg gcaaccgagc tagccgaatg 5580 
tatcgcaccg gcctcggccc gcgtctatgc 564 0 
ggagatcgtt gccagccaac gccaccccgg 5700 
tggcgtcctg gactccaagc tcggtaggct 5760 
cctgtacgga agcttcctgc ccggcactca 5820 
gctagagctg ctccctgcgg gcgcttggct 5880 
ctcccgaggc tgacccctca catctccgct 5940 
ccgccgggca acgacgacga ccacggtgat 6000 
accaacgagg cgtcgccgct ggacgcctta 6060 
gccgaaggcg acctggacgc cctccatgcg 612 0 
gagttgttca cggtgaccaa ccctcaaggg 6180 
agaatccagc acgtcgagct gacggacaag 6240 
gacgagatct tcgttattgc cgatctggcc 63 00 
ttcatggtgg agaacatcgg tgaactgacc 6360 
cgggaattcg tggggatgac cctgaatctg 6420 
gccgaagtgt tcgccacccg ctacgatgtc 6480 
tgatcgcttg gccagtctgt tcgaaagcgc 654 0 
gtcgctagat ctgttcaccg agatcaccaa 6600 
cggccggatc cggtgtgggg acaccgaccg 6660 
ccgcaatttc ggacagttgt cgggatcggt 6720 
gattgccatc ggggggctgt acggcgatat 6780 
caccatgggc tttgccgcat gcgaggcagc 6840 
cttagaggcc gccccggtcg cgggttccga 6900 
ctacggcgcg gccgaacgct ggaccgacgt 6960 
gccggacaag tttttggccg gcgcggccgg 7020 
ggccttgttc accgaagccg aacgccgact 7080 
ggcgtgtgcg cgcgccatcg cctggtatct 7140 
aagcgccgcg gtggcgctgc tggaatggtt 7200 
tgcggcgctg aaggatccct cctaccggct 7260 
ccgcgccgat ccctgggatc cgggcagtgt 7320 
gctcgccgag gcccaagccg aactcgaccg 7380 
gattgaacgc taccgcgcgg cgacgctgat 7440 
ggtcgcccag cccagcaagc acatgatctt 7500 
gatcgcgcgg gtggtggcca atatcctggc 7560 
cgtcgagacg tcgcgcaagg acttcgtcgc 7 62 0 
cgctaagacg atcgatcagg cgctgggcgg 7680 
ggtgcaggaa agagacggcc gcaccgatcc 774 0 
ggcgcggatg gagaacgacc gggaccggct 7800 
catagatcgg ctgctggaaa ccaacgaggg 7860 
gttcgacacc tattcccccg aggaactcct 7 92 0 
tgattcggcg ttgaccgcag aggcggccga 7980 
gcagcgcatg ttgcgcggcc ggcgcgccct 8040 
ccagctggtg gaggccagcg agcaatgccg 8100 
cgacaccctc gacgaagacc ggcttcgcga 8160 
cgccgcggtg cacgcacacc tcaacatgag 8220 
caaggttcag gttagcggct ggcgttttct 8280 
ccgggacacc cggatgtttg acgacccgct 8340 
catcgtcgtc gcggtcctga ttctggcggg 8400 
aggcaaactc ggcggcacca gcctgttcac 8460 
gctgtccgga cagttgcatc cggtctacaa 8520 
tccggccaac ccggccaccg tgaagtcctc 8580 
cgttggaatc cccggcgccc cctacgccac 8640 
gaccctatgc gacaccgtcg cccgagccga 8700 
catcgcgatg ccgttggaga tcgatgcttc 8760 
gctggtgtcc taccagggcg aaacctggat 8820 




cgtcacaact aagggacgcc acgccataga 
ggggataccg gtgacggcca ggccaacccc 
tgatatgggg ccctggcagc tgccgccgat 
cctacctgat gatctagtga tcggatcggt 
atactatgtg gtgctgcccg acggcatcgc 
gcgcgccacc caggcgcacg ggctggtcgc 
cagaatcgcc gaacgggtat acccctcacc 
ccggccgcag gatcccgcgc tgtgctggtc 
gcagtcaacg gtgctgtccg gccggcatct 
gatcaagcag atccacggga cggcgaccgt 
gcaatccccc gatcctcgat acaccgaatc 
ttatggggtg cctaacgcgg agacagccaa 
gccctgggag atcgttcgtc tcctggtcga 
gctcgagcac gacacgctgc ccgctgaccc 
cggagccccc tgatgacgac caagaagttc 
accccgggcg agatcagcct cacgccgccc 
ggcgtccaaa agatccttcc ctacgtgatg 
atggtggccg gcggcaccag gcagctgtcg 
atcgtgatga tggtcggcgg tctggccggt 
gaaatcaacg ccgaccgcaa ggagtacctg 
acgtcctcgg ccacctctca ggtggcgttc 
ctgttgtcga tcgtcggcac ccaacggcag 
gcggccaccc gaatcggtat cggtgaccag 
gtcggcgggg agttggccgc cgccagcgca 
catatgtggg tggtcaagtt tctacgaacc 
ctgcaactcc gtacctttcc gactatcgcg 
ctgatgacgg cgatgatctg tcacctagcc 
cgggtgctca ccgaggaacc cgacgacccc 
gtacagcacc agaccgaaac cgatgcggcc 
gaaggtctgt cggacctggc cgcgcgcggg 
ccctacgtag tcgtcgtcga cctgaccggc 
gccggtgtca cggtgatcac gttgggcaac 
cacgaggatg ggacggctga tgaccggctc 
gtcaccgatc ggatgtcgcc gcagcaagcc 
tccatcacgg gcaccatcct cgacaagacg 
tggcaccagc tggtcggtgc gcaaagtgtc 
tacaccg[aca ccgaccgtga ccggctaaag 
aacgtcatgt acctggacat caaagagggc 
ctcatcggga ccacggggtc tgggaagtcc 
gtggcaatga ctcatccaga tcaggtgaat 
accttcctgg gaatggaaaa gcttccgcac 
gaagccgagc tcgtcagccg gatgggcgag 
tcgatcctcc gacaggccgg gatgaaagtc 
gaatacgaga agtaccgcga acgcggtgcc 
gtcgtcgacg agttcgccga gctgttgcag 
cggatctgcc gcgtcgggcg -gtcgctgagg 
cagaccggcg gtgttcgcat cgacaaactg 
cgcaccacca gctctcatga atccaaggcg 
accaacaagg agagcggtgt cgggtttctc 
agcaccttct acatcagtgg gccatacatg 
gaagccggag ggcccggtca acagaccact 
gcggcaccgg ttctcgagga ggcgccgaca 
gcagcgatga ggaggagcgg cgccaacggc 
tgaggaggag cggcgcgcat gactgctgaa 
ctggaccagc tcggcactgc tgaatcgcgt 
aatccggtcc cgctcaacga gctcatcgcc 
ctggggatca tggatgaacc gcgccgccat 
ggggccggcg gcaacatcgg tattgggggc 
cagacgatgg tgatgtcggc cgccgccaca 
atcgacctag gtggcggcgg gctgatctat 
gccaatcggt ccgagcccga caaggtcaac 
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tctgaccgac cgcgccctca cctcgtcgat 8880 
gatctcggag ggcatgttca acgcgctgcc 8940 
accggcggcg ggcgcgccca attcgcttgg 9000 
cttccagatc cacaccgaca agggcccgca 9060 
gcaggtcaac gcgacaaccg ctgcggcgct 9120 
gccaccggca atggtgccca gtctggtcgt 9180 
gctacccgat gaaccgctca agatcgtgtc 9240 
atggcaacgc agcgccggcg accagtcgcc 93 00 
gccgatatcg ccctcagcga tgaacatggg 9360 
ttacctcgac ggcggaaaat tcgtggcact 942 0 
gatgtactac atcgatccac agggcgtgcg 94 80 
gtcgctgggc ctgagttcac cccaaaacgc 954 0 
cggtccggtg ctgtcgaaag atgccgcact 96 00 
tagcccccga aaagttcccg ccggagcctc 9660 
actcccacca ttacccgtgg cccccggttg 9720 
gatgacctgg gcatcgacat cccaccgtcg 9780 
ggtggcgcca tgctcggcat gatcgccatc 9840 
ccgtacatgt tgatgatgcc gctgatgatg 9900 
agcaccggtg gtggcggcaa gaaggtgccc 9960 
cggtatttgg caggactacg cacccgagtg 10020 
ttctcctacc acgcaccgca tcccgaggat 10080 
tggtcccggc cggccaacgc cgacttctat 10140 
ccggcggtgg atcgattatt gaagccggcc 10200 
gcacctcagc cgttcctgga gccggtcagt 10260 
catggattga tccatgactg cccgaaactg 10320 
atcggcgggg acttggcggg ggcagccggc 103 80 
gtgttccacc caccggacct gctgcagatc 10440 
gactggtcct ggctcaaatg gcttccgcac 10500 
gggtccaccc ggctgatctt cacgcgccag 10560 
ccacacgcac ccgattcgct tcccggcggc 10620 
ggcaaggctg gattcccgcc cgacggtagg 106 8 0 
catcgcggct cggcctaccg catcagggtg 1074 0 
cctaaccaat cgtttcgcca ggtgacatcg 10800 
agccgtatcg cgcgaaagtt ggccggatgg 10860 
tcgcgggtcc agaagaaggt ggccaccgac 10 920 
gaggagataa caccttcccg ctggaggatg 10980 
atcccgtttg gtcatgaact aaagaccggc 1104 0 
gcggaattcg gcgccggacc gcacggcatg 11100 
gaattcctgc gcaccctgat cctgtcgctg 11160 
ctcctgctca ccgacttcaa aggtggttca 11220 
actgccgctg tcgtcaccaa catggccgag 112 80 
gtgttgaccg gagaactcga tcggcgccag 11340 
ggcgcggccg gagccctgtc cggcgtggcc 114 00 
gacctacccc cgctgccaac gcttttcgtc 11460 
agtcacccgg acttcatcgg gctgttcgac 11520 
gtccatctgc -tgctggctac- ccagtcgctg-_11580 _ 
gagccaaacc tgacatatcg aatcgcattg 11640 
gtaatcggca caccggaggc gcagtacatc 11700 
cgggtcggca tggaagaccc ggtcaagttc 11760 
ccgccggcgg caggcgtcga aaccaatggt 11820 
agacaagccg cgcgcattca caggttcacc 11880 
ccgtgacccg cgccggcgac gatgcaaagc 11940 
ccgcgccggc gacgatgcaa agcgcagcga 12000 
ccggaagtac ggacgctgcg cgaggttgtg 12060 
gcgtacaaga tgtggctgcc gccgttgacc 12120 
cgtgatcggc gacaacccct gcgatttgcc 12180 
ctacaggatg tgtggggcgt agacgtttcc 12240 
gcacctcaaa ccgggaagtc gacgctactg 12300 
cactcaccgc gcaacgttca gttctattgc 12360 
ctcgaaaacc ttccacacgt cggtggggta 12420 
cgggtggtcg cagagatgca agccgtcatg 124 80 



cggcaacggg aaaccacctt caaggaacac cgagtgggct cgatcgggat gtaccggcag 1254 0 
ctgcgtgacg atccaagtca acccgttgcg tccgatccat acggcgacgt ctttctgatc 12600 
atcgacggat ggcccggttt tgtcggcgag ttccccgacc ttgaggggca ggttcaagat 12 660 
ctggccgccc aggggctggc gttcggcgtc cacgtcatca tctccacgcc acgctggaca 12720 
gagctgaagt cgcgtgttcg cgactacctc ggcaccaaga tcgagttccg gcttggtgac 12780 
gtcaatgaaa cccagatcga ccggattacc cgcgagatcc cggcgaatcg tccgggtcgg 12840 
gcagtgtcga tggaaaagca ccatctgatg atcggcgtgc ccaggttcga cggcgtgcac 12900 
agcgccgata acctggtgga ggcgatcacc gcgggggtga cgcagatcgc ttcccagcac 12960 
accgaacagg cacctccggt gcgggtcctg ccggagcgta tccacctgca cgaactcgac 13 02 0 
ccgaacccgc cgggaccaga gtccgactac cgcactcgct gggagattcc gatcggcttg 13080 
cgcgagacgg acctgacgcc ggctcactgc cacatgcaca cgaacccgca cctactgatc 1314 0 
ttcggtgcgg ccaaatcggg caagacgacc attgcccacg cgatcgcgcg cgccatttgt 13200 
gcccgaaaca gtccccagca ggtgcggttc atgctcgcgg actaccgctc gggcctgctg 13260 
gacgcggtgc cggacaccca tctgctgggc gccggcgcga . tcaaccgcaa cagcgcgtcg 13320 
ctagacgagg ccgttcaagc actggcggtc aacctgaaga agcggttgcc gccgaccgac 13380 
ctgacgacgg cgcagctacg ctcgcgttcg tggtggagcg gatttgacgt cgtgcttctg 13440 
gtcgacgatt ggcacatgat cgtgggtgcc gccgggggga tgccgccgat ggcaccgctg 13500 
gccccgttat tgccggcggc ggcagatatc gggttgcaca tcattgtcac ctgtcagatg 13560 
agccaggctt acaaggcaac catggacaag ttcgtcggcg ccgcattcgg gtcgggcgct 13 62 0 
ccgacaatgt tcctttcggg cgagaagcag gaattcccat ccagtgagtt caaggtcaag 13680 
cggcgccccc ctggccaggc atttctcgtc tcgccagacg gcaaagaggt catccaggcc 1374 0 
ccctacatcg agcctccaga agaagtgttc gcagcacccc caagcgccgg ttaagattat 13800 
ttcattgccg gtgtagcagg acccgagctc agcccggtaa tcgagttcgg gcaatgctga 13860 
ccatcgggtt tgtttccggc tataaccgaa cggtttgtgt acgggataca aatacaggga 13 92 0 
gggaagaagt aggcaaatgg aaaaaatgtc acatgatccg atcgctgccg acattggcac 13980 
gcaagtgagc gacaacgctc tgcacggcgt gacggccggc tcgacggcgc tgacgtcggt 1404 0 
gaccgggctg gttcccgcgg gggccgatga ggtctccgcc caagcggcga cggcgttcac 14100 
atcggagggc atccaattgc tggcttccaa tgcatcggcc caagaccagc tccaccgtgc 14160 
gggcgaagcg gtccaggacg tcgcccgcac ctattcgcaa atcgacgacg gcgccgccgg 1422 0 
cgtcttcgcc gaataggccc ccaacacatc ggagggagtg atcaccatgc tgtggcacgc 142 8 0 
aatgccaccg gagctaaata ccgcacggct gatggccggc gcgggtccgg ctccaatgct 1434 0 
tgcggcggcc gcgggatggc agacgctttc ggcggctctg gacgctcagg ccgtcgagtt 14400 
gaccgcgcgc ctgaactctc tgggagaagc ctggactgga ggtggcagcg acaaggcgct 14460 
tgcggctgca acgccgatgg tggtctggct acaaaccgcg tcaacacagg ccaagacccg 1452 0 
tgcgatgcag gcgacggcgc aagccgcggc atacacccag gccatggcca cgacgccgtc 14580 
gctgccggag atcgccgcca accacatcac ccaggccgtc cttacggcca ccaacttctt 1464 0 
cggtatcaac acgatcccga tcgcgttgac cgagatggat tatttcatcc gtatgtggaa 14700 
ccaggcagcc ctggcaatgg aggtctacca ggccgagacc gcggttaaca cgcttttcga 14760 
gaagctcgag ccgatggcgt cgatccttga tcccggcgcg agccagagca cgacgaaccc 14820 
gatcttcgga atgccctccc ctggcagctc aacaccggtt ggccagttgc cgccggcggc 14880 
tacccagacc ctcggccaac tgggtgagat gagcggcccg atgcagcagc tgacccagcc 14 94 0 
gctgcagcag gtgacgtcgt tgttcagcca ggtgggcggc accggcggcg gcaacccagc 15000 
cgacgaggaa gccgcgcaga tgggcctgct cggcaccagt ccgctgtcga accatccgct 15060 
ggctggtgga tcaggcccca gcgcgggcgc gggcctgctg cgcgcggagt cgctacctgg 1512 0 
cgcaggtggg tcgttgaccc gcacgccgct gatgtctcag ctgatcgaaa agccggttgc 15180 
cccctcggtg- atgccggcgg-ctgctgccgg- atcgtcggGg-acgggtggcg-ccgctccggt— 1524 0 
gggtgcggga gcgatgggcc agggtgcgca atccggcggc tccaccaggc cgggtctggt 15300 
cgcgccggca ccgctcgcgc aggagcgtga agaagacgac gaggacgact gggacgaaga 15360 
ggacgactgg tgagctcccg taatgacaac agacttcccg gccacccggg ccggaagact 1542 0 
tgccaacatt ttggcgagga aggtaaagag agaaagtagt ccagcatggc agagatgaag 154 8 0 
accgatgccg ctaccctcgc gcaggaggca ggtaatttcg agcggatctc cggcgacctg 1554 0 
aaaacccaga tcgaccaggt ggagtcgacg gcaggttcgt tgcagggcca gtggcgcggc 15600 
gcggcgggga cggccgccca ggccgcggtg gtgcgcttcc aagaagcagc caataagcag 15660 
aagcaggaac tcgacgagat ctcgacgaat attcgtcagg ccggcgtcca atactcgagg 1572 0 
gccgacgagg agcagcagca ggcgctgtcc tcgcaaatgg gcttctgacc cgctaatacg 15780 
aaaagaaacg-gagcaaaaac .atgacagagc agcagtggaa tttcgcgggt atcgaggccg 15840 
cggcaagcgc aatccaggga aatgtcacgt ccattcattc cctccttgac gaggggaagc 15900 
agtccctgac caagctcgca gcggcctggg gcggtagcgg ttcggaggcg taccagggtg 15 960 
tccagcaaaa atgggacgcc acggctaccg agctgaacaa cgcgctgcag aacctggcgc 16020 
ggacgatcag cgaagccggt caggcaatgg cttcgaccga aggcaacgtc actgggatgt 16080 
tcgcataggg caacgccgag ttcgcgtaga atagcgaaac - acgggatcgg gcgagttcga 16140 



ccttccgtcg 
catggcggcc 
tatggcagcg 
aaacctaccg 
gttcgtgtcg 
gccgatcgcc 
ccccatgccc 
catcgccgga 
acctgcaccG 
gccaaccgga 
acatcaaccc 
cccgcccgct 
ccaacactcc 
cgggaaggta 
cgcgcagctc 
tctggctccg 
caactccggt 
ggcgcaacct 
gccggatctc 
gaaggtgaag 
ctggcgacat 
gtacgagctg 
cgtcgtcggt 
gttggctcag 
cctcgccgat 
agagctgtcg 
agtgctgccg 
tttcatcgcc 
cttcttcgac 
aagtgtctca 
cggttaccaa 
acccaatgtc 
ggtcgtggtc 
gctcgaccct 
cgagagggct 
gcaaccgctg 
gatttggtac 
ctttccgagg 
caaggcgtgt 
gatgacgccg 
cgctaccgac 
gagttcgacc 
cccgtcatcg 
ttggcgattg 
taccagagcg 
-gccgcagcgc- 
gccggcgccg 
cgtcatgagt 
gccttcggct 
attgtgacga 
ccggtacccg 
gctaccagcg 
gtccggctca 
ggcaccctga 
cacagcctgg 
gccgagcgct 
acggccaaac 
acggtagccc 
ccggtcgtaa 
atgctgctgt 
tcggctgatt 



gtctcgccct 
gactacgaca 
cagccgttct 
aagcccaacg 
gccccgccgc 
gcaggagagc 
atcgccggac 
cccgaaccgg 
accccaaccg 
gcgccgcagc 
cggcgcaccg 
ccgtccagac 
cgacgtgcgc 
gcaactggtc 
gcccccggaa 
cccacccgcc 
cggcgtgccg 
gattcaatta 
gacgcgacac 
ccccagaaac 
tgggtgcatg 
gacctgcacg 
ctcaaaggtg 
gtgcgggccg 
cgggtagggc 
cactacaacg 
gcaccggaat 
gatcctgcgt 
ccgctgaccc 
atcgacggcg 
gatttggcga 
gcagttaaag 
atgccgtggg 
atctacaagc 
ggacgtcgtt 
cgcggcctgc 
tgccagcggc 
tgttggaaga 
gggcgttcgc 
gggtggtcga 
cgttggtcga 
gcacggcatt 
ggatggcgat 
gcatcctggg 
gccacctggc 
-tggccgtgcc 
ctacggccgt 
tggcgtcgtt 
atggatacca 
atgcggccaa 
gcgaaaccgt 
aagaaacccc 
ccgagcgcag 
ttctggctgc 
tggtcgcggg 
ggtgtgcgtg 
tcatcatctg 
tggttgcgct 
aacgaactct 
ggatcaccgg 
ggcggttcct 



ttctcgtgtt 
agctcttccg 
tcgaccccag 
gccagactcc 
cgccaccccc 
cgccctcgcc 
ccgaaccggc 
ccccacccaa 
aatcccagtt 
aaccggaatc 
caccagcacc 
cgtctgcgtc 
gccggggtca 
catccatcca 
cggagccctc 
ccgcgccgac 
agcgacgcgt 
cggccgcaac 
agaaatcctt 
cgaaggccac 
cgttgacgcg 
ctcgagtccg 
gggctggcaa 
accggatcct 
gacaatcggg 
acatccgcgc 
acagctcggc 
cgaggtttta 
gcggcgtgct 
cacaacaggc 
gccgcgcatg 
acctggtgcg 
acaggcacat 
gcaaggtcct 
gagcgcacct 
caccacccgg 
ggtgccgatg 
cacgccggct 
tcgtcccgga 
cgggtcactg 
ggatgtcatc 
gaatcgcttt 
gcgggcgtgg 
gatcgctgtg 
cgagtgccta 
-gttgccgcgc- 
gctgtttttg 
tgccgtgatc 
ggactgggtc 
gctgaccgtc 
ggacaacgag 
gacctggcag 
caaactggcc 
cggtgccatc 
tttgatcacg 
ggcgttgctg 
gtacccgcac 
cgtggtggtc 
ggaattgatc 
ggtgtacgac 
gacagaacat 



tatacgtttg 
gccgcacgaa 
tgcttcgttt 
gcccccgacg 
acccccacct 
ggaaccggcc 
cccacccaaa 
accacccaca 
ggcgcccccc 
accggcgccc 
gccctgggca 
cccggccgaa 
ccgctatcgc 
ggcgcggctg 
gccagcgccg 
agaacctccc 
ccaccccgat 
cactggcggt 
aaggccggcg 
gaagccgccc 
aatcaacctg 
ccgcaatccc 
aaccacgctg 
ggctctagac 
cgcgaccatc 
acacactagc 
gcagcgcgcg 
caacctcgtc 
gtccacggtg 
gtcggtcgcg 
cgtggtcatc 
gcatttcgaa 
tgcggccgga 
cgaattggcc 
gctgttgctg 
gtgacgatcc 
gaaacttata 
gatgtactcg 
tcgccgccgc 
ctgactctgg 
gacgcgatcg 
gtgggggcgg 
tgggaaactg 
ctggtaggca 
ctggtcacga 
-ggggtcaaet- 
accttgatga 
accgctatcg 
cccgcggggg 
gcggtcgcgc 
gagttgctcg 
gccatcatcg 
aagcaacttc 
gcggtcgtgg 
accgtctgcg 
gcggcgacgg 
tatgcctggc 
gggtcgatgg 
gacggcgcca 
acggtccgca 
cgaggacacg 



agcgcactct 
ggtatggaag 
ccgccggcgc 
tccgacgacc 
ccgcctccgc 
gcatctaaac 
ccacccacac 
cctccgatgc 
agaccaccga 
cacgtaccct 
aagatgccaa 
ccaccgaccc 
acagacaccg 
cgggcagagg 
ttgggccaac 
cccagcccct 
ttagccgccc 
cgtcgccgca 
gccaaggggc 
aaagtggtgt 
ggcctgtcac 
cgcgggtcgt 
acagcagcgt 
gcggatccag 
gctgatgtgc 
gtcaatgcgg 
ctcagcgacg 
ttggctgatt 
tccggtgtcg 
ttggactggt 
aatcacatca 
cagcaagttc 
accgagattt 
gcagcgctat 
ctggtcctac 
tgaccggcag 
ttgacgacac 
gcggcttcga 
tgaagctcga 
tgtcagtcag 
ccgtgcttga 
cgatcccgct 
ggcgtagctt 
gcttcgtcgc 
cgtatctgct 
-Ggfctgggggc- 
cgcggggcgg 
cggtcatcgc 
ggatcgcatt 
ggatcgcgct 
atcccgtcgc 
cgtcggtgcc 
tgatcggata 
tgcgcgggca 
gatttcgctc 
tcgcgattcc 
tgttgttgag 
ctcacgtccg 
tgatcgctgc 
atatccggtt 
gcgcaggttt 



gagaggttgt 
ctccggacga 
ccgcatcggc 
tgtcggagcg 
caactccgat 
cacccacacc 
cccccatgcc 
ccatcgccgg 
caccacaaac 
cgcacgggcc 
tcggcgaacc 
ggcctgcccc 
aacgaaacgt 
aagcatccgg 
cgagatcgta 
cgccgcagcg 
aacatgccgc 
agcgtgcagc 
cgaaggtgaa 
cgcagcgcgg 
ccgacgagaa 
atcagatcgc 
tggggtcgac 
gcgccggaaa 
ttgcagaaaa 
tcaatctgga 
ccgactggca 
gtggggccgg 
tggtcgtggc 
tgcgcaacaa 
tgccgggaga 
aacccggccg 
cactcgactt 
ccgacgattt 
cgccgcgggg 
acggatgacc 
cgtcgcggtg 
ctttaccgcg 
ccagtcactc 
tcgcaccgag 
cgagtcacct 
tttgaccgcg 
gtggtggccg 
gaacaggttc 
gatcgcaacc 
-gccacaagtt- 
ccctcggaag 
ggccgccgct 
cgggctgttc 
gccgccgatt 
gaccccggag 
cgcgtccgcg 
cgtcacgtcg 
cttctttgta 
gcggctttac 
gacgggtctg 
cgtctacctc 
gcgcgtttca 
catcattccc 
ctgagccgga 
gcataccttc 



16200 

16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
-18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680. 
19740 
19800 



ggcgcccgac- aaattgctgc gattgagcgt gtggcgcgtc cggtaaaatt tgctcgatgg 19860 
ggaacacgta taggagatcc ggcaatggct gaaccgttgg ccgtcgatcc caccggcttg 19920 
agcgcagcgg ccgcgaaatt ggccggcctc gtttttccgc agcctccggc gccgatcgcg 19980 
gtcagcggaa cggattcggt ggtagcagca atcaacgaga ccatgccaag catcgaatcg 20040 
ctggtcagtg acgggctgcc cggcgtgaaa gccgccctga ctcgaacagc atccaacatg 20100 
aacgcggcgg cggacgtcta tgcgaagacc gatcagtcac tgggaaccag tttgagccag 20160 
tatgcattcg gctcgtcggg cgaaggcctg gctggcgtcg cctcggtcgg tggtcagcca 20220 
agtcaggcta cccagctgct gagcacaccc gtgtcacagg tcacgaccca gctcggcgag 20280 
acggccgctg agctggcacc ccgtgttgtt gcgacggtgc cgcaactcgt tcagctggct 20340 
ccgcacgccg ttcagatgtc gcaaaacgca tcccccatcg ctcagacgat cagtcaaacc 20400 
gcccaacagg ccgcccagag cgcgcagggc ggcagcggcc caatgcccgc acagcttgcc 20460 
agcgctgaaa aaccggccac cgagcaagcg gagccggtcc acgaagtgac aaacgacgat 2 052 0 
cagggcgacc agggcgacgt gcagccggcc gaggtcgttg ccgcggcacg tgacgaaggc 2 0580 
gccggcgcat caccgggcca gcagcccggc gggggcgttc ccgcgcaagc catggatacc 2 0640 
ggagccggtg cccgcccagc ggcgagtccg ctggcggccc ccgtcgatcc gtcgactccg 20700 
gcaccctcaa caaccacaac gttgtagacc gggcctgcca gcggctccgt ctcgcacgca 20760 
gcgcctgttg ctgtcctggc ctcgtcagca tgcggcggcc agggcccggt cgagcaaccc 20820 
ggtgacgtat tgccagtaca gccagtccgc gacggccaca cgctggacgg ccgcgtcagt 20880 
cgcagtgtgc gcttggtgca gggcaatctc ctgtgagtgg gcagcgtagg cccggaacgc 2 0 940 
ccgcagatga gcggcctcgc ggccggtagc ggtgctggtc atgggcttca tcagctcgaa 21000 
ccacagcatg tgccgctcat cgcccggtgg attgacatcc accggcgccg gcggcaacaa 21060 
gtcgagcaaa cgctgatcgg tagtgtcggc cagctgagcc gccgccgagg ggtcgacgac 21120 
ctccagccgc gaccggcccg tcattttgcc gctctccgga atgtcatctg gctccagcac 21180 
aatcttggcc acaccgggat ccgaactggc caactgctcc gcggtaccga tcaccgcccg 21240 
cagcgtcatg tcgtggaaag ccgcccaggc ttgcacggcc aaaaccgggt aggtggcaca 213 00 
gcgtgcaatt tcgtcaaccg ggattgcgtg atccgcgctg gccaagtaca ccttattcgg 21360 
caattccatc ccgtcgggta tgtaggccag cccatagctg ttggccacga cgatggaacc 21420 
gtcggtggtc accgcggtga tccagaagaa cccgtagtcg cccgcgttgt tgtcggacgc 214 80 
gttgagcgcc gccgcgatgc gtcgcgccaa ccgcagcgca tcaccgcggc cacgctggcg 21540 
ggcgctggca gctgcagtgg cggcgtcgcg tgccgcccga gccgccgaca ccgggatcat 21600 
cgacaccggc gtaccgtcat ctgcagactc gctgcgatcg ggtttgtcga tgtgatcggt 21660 
cgacggcggg cgggcaggag gtgccgtccg cgccgaggcc gcccgcgtgc tcggtgccgc 21720 
cgccttgtcc gaggtagcca ccggcgcccg cccagtggca gcatgcgacc ccgcgcccga 21780 
ggccgcggcc gtacccacgc tcgaacgcgc gcccgctccc acggcggtac cgctcggcgc 21840 
ggcggccgcc gcccgtgcgc ccgggacacc ggacgccgca gccggcgtca ccgacgcggc 21900 
ggattcgtcc gcatgggcag gccccgactg cgtccccccJg cccgcatgct ggcccggcac 21960 
accaggttgc tccgccaacg ccgcgggttt gacgtgcggc gccggctcgc cccctggggt 22020 
gcccggtgtt gctggaccag acggaccggg agtggccggt gtaaccggct ggggcccagg 22080 
cgatggcgcc ggtgccggag ccggctgcgg gtgtggagcg ggagctgggg taacgggcgt 22140 
ggccggggtt gccggtgtgg ccggggcgac cgggggggtg accggcgtga tcggggttgg 22200 
ctcgcctggt gtgcccggtt tgaccggggt caccggggtg "accggcttgc ccggggtcac 22260 
cggcgtgacg ggagtgccgg gcgttggtgt gatcggagtt accggcgctc ccgggatggg 22320 
tgtgattggg gttcccgggg tgatcggggt tcccggggtg atcggggttc ccggtgtgcc 22380 
cggtgtgccc ggggatggca cgaccagggt aggcacgtct gggggtggcg gcgacttctg 22440 
ctgaagcaaa tcctcgagtg cgttcttcgg aggtttccaa ttcttggatt ccagcacccg 22500 

ctcagcggtc-tcggcgacca-gactgacatt-ggccccatgc-gfcegccgtga~eeaafcgaatrt~2 2 56G 

gatggcggta tggcgctcat cagcatccag gctagggtca ttctccagga tatcgatctc 22620 
ccgttgagcg ccatccacat tattgccgat atcggattta gcttgctcaa tcaacccggc 22680 
aatatgcctg tgccaggtaa tcaccgtggc gagataatcc tgcagcgtca tcaattgatt 22740 
gatgtttgca cccagggcgc cgttggcagc attggcggcg ccgccggacc ataggccgcc 22800 
ttcgaagacg tggcctttct . gctggcggca ggtgtccaat acatcggtga ccctttgcaa 22860 
aacctggcta tattcctggg cccggtcata gaaagtgtct tcatcggctt ccacccagcc 22920 
gcccggatcc agcatctgtc tggcatagct gcccgtcggc ctggtaatac tcatccccta 22980 
ctgccctccc caaaccgcca gatcgcctcg cggatcaccg tccggttggc ctccggcatt 23040 
tcacgccggc tcggccgctg. gatccacccc gcgccggtat tcgcagtaac ccgttgaatc 23100 
cgcgcgcatg atgcaccgct tgggcgatca gccgggtggt cacctcgctt gcgctggccg 23160 
cgctgtcgca cggggcgctc ggtggtaacg gacgtcataa ttaaccagcg taaccgaacc 23220 
taagaccagc tagctgcggc aatattggcg accaggacta tggcgccctc cgaacccggc 23280 
cgatccatgt caaaacattg acaatgcgta ctcacgccgt gtcgggcgcg ctgaatgacc 23340 
gcattgcggc gctcattcgg tgcgtagtcg ctaccaccgc aacaatgggc ttaggccatt . 23400 
ccttcgttca tcgcgcggga -catggccgat-aacgcagcgg-tcagcfegctc-gGccgccgcg-23460 




tcgttatacg cggacgccgc ggcctgcgca ttgtgcagcg cctcgttgac ccgctgagcc 23520 
accgcctcgg cacccagctt cttcagcaaa ccatcttcga tgcgcaggcc ggtgagccac 23580 
tggtgcccat tgatcgtcac ttcgacggtc tcggcttcgt cggtggcgcg gaaggatccg 23640 
ttgttcatct gattgagcgt cccgtctagg gccgactgaa accgcgccgc cagcgtcaac 23700 
gcccgggcga catgcgggtc caattcgtcc atgctcactt cgactcctta ctgtcctggc 23760 
gccgacggtt accaatgacg gcctcggtcc atgcccgatc ctcggtgtag agcgcctcgt 23820 
cttcctgctg agaacccttg gacttggcgc ccccttgtcc ctgatgcgcg gcacccatcg 23880 
gcattcccat gccaccgccg cccagcgcgg cgccgccgcc ggcccttccc tggcctaagc 2 3 940 
cggcaatgtc accagcgcca gcgggccgca ccgattcggc gcccccgatc gcggatccca 2 4 000 
acggcgccga cggcaccccg ccgcctccac cgccaccgag cgatgccgct ttgaccgcca 24060 
cgtcgcccga cagcgctgcg gcttcccgcc cagccgacgt cagctgcgcc gccgtgtcag 24120 
ccgggaggcc accacccggc gatccggtag gcggaaccat cggtgcggct ggcatcccgg 24180 
taccgggagt cacaccggag ccgtcagacg gcggcatcag gaagccaggg atcaatccct 24240 
gctcttgcgg aggcgggggc gggtcgatct tgatggcggg gggaggcttc ggcgggttta 24300 
ccggttccag ggctgccttg ttgttgtatt cggtcagcac cttctccgac ctctgctgat 24360 
actccgcgta caccgggaga atttggtcgc gggccgaagg gttttccgcg taaagccgtt 24420 
cgagcccgac tatgtcttca taagtcggat gttcccgcct agcccacacg tgcagctgcg 244 80 
cgacatattg agcctgcttg gccatcgcag cgctcaattt ggccatgtgg agtatccatt 24540 
gccgttgttg atcgagcgaa gcctcgcaag cggtagccgc atcgccttcc cagttgtcaa 24 600 
acccccggaa ccgcttgacg tcgccttgca gcgtcaggtt gaaagtgttc cacccatccg 24660 
caaagtgcgc gagcgatgcg ccttggtcgc ccgtttcgag cttccttgcc gcttctttga 24720 
gatccatgaa gttgggttca ccggccgtgg ccaccctcgg cgtatcggtt agttcggccg 24780 
aactgtcccc tccgacggcc ccggccgatt ctgcctgcac agttccttcg ccgtcgttgt 24840 
ccagcgcggt cgcagcctcc tcatcaacct cgccatacgc cttggccgcg ttgcgcagcg 24900 
aggtcgccag acgctgccgc tctttggcac cggccgccag gtattcccgc atgttgtcgg 24 960 
cggacaatac cagctgttgg gcggcgtttt tagccgccgt gagttcgcac ggtgtgatgg 25020 
ggacatcagt cggtgggtcc gccatcgggg cctccacctc gttggccctg ttcaaaatct 25080 
cttgctgatc caccgtcacg gtctgcgact gcgtcatatc ggatcatcct ccttagtgct 25140 
atagccatta tcgtcgctaa actgaaaggt tcctgcacta atttgatgcc gcccgttcat 25200 
gccggcatcg cgaacggatc gccctacttc ggcagcgcca tctggtagcg gctttcctcg 25260 
ggtggggaaa cccggcgaat cggcagctgc cgatgccgcg gggtaccgat cacattgtgc 25320 
cgcagaatca cccggtcaat accgggatgc gggccgagat aggtcgtcgc attcggccac 25380 
gccaccttta cctcctgccc gatgtgtgcg ccgatcaacc gggcaaattc ctcgaactgt 25440 
ggcccgactg tgaccatcgc acctgccgcc gccgcacgca ccacgaactg ggtgaatgtc 25500 
tgagcgtcac ccaggttgag ggcgatgtcg acatcgtcga agggcatgta gaccgggcat 25560 
cggttcaccg tctcgccgac cagtacccca gctgacccga tcggcagctg gcagtggcgg .25620 
ttggccacca gatgctggcc ttgcagcgcg ggccgctgcc cgccaaatag gcgggcgaag 25680 
cccctgggtg tcttgggctt gtccgccgtg gtcagcaaca ccgtggactg cggggccatc 25740 
cccggcgcga cccggactct ggtgatggtg tggtccgcgc gcgccgacca ccatacatcc 25800 
ggacctccgg gcgccgcgta ggcggcagtg taggcatcgc gccccttgat catcgaccat 25860 
ttctcccgca caaagccgat gtcggtggcg tggtcgtagt catcgaagct gcggccacac 2 5920 
accgcgtcga caccatggct agccagtcga tcggcaatgc gcgtcgcgga cgccaccaaa 25980 
taccgggcca gtcctgcgac gccttcatcg cggcgctgcg ccgatttgcg ggtgcgttcc 26040 
gggtcggcgc gcagcacgat ccaggtccgg cggttcgccg gcgccgggtc tgtcccgatc 26100 
acctgctgat acagactcac cacgtccggc gctgcggtat tgccgacgcg gtagccggct 26160 
" gagacga tratr-cggcctcc aa-g tcgggac ag-tgc ac cgac a-ggagc tc c te-eae e ag te Gg-2 6 2 2 0 
gtgtccagca tgtcgtcggt gtgggcttgc ccgtcgacga tgaccgtcgg cgtgaatggt 26280 
cggggaatga gctcgattac ggcgaccaga aactcgcctt gccagcgcac cgcaacgtga 26340 
tctcctggct tcacggtggc cccgaccaca ggttctgacg aggaatccgg gggccgtcgg 26400 
cgccgccgca accacgcgta caccgccgcc acccagccgg tgatccggcg gccgtagaaa 26460 
gtgaccgtgg ccacgatgac gcccaacgag gccagcgcaa tccccgccca ccagtagcgc 26520 
gtctccaaga atgcgatgat gcatggcggg gccaacgcgg aggcaagcaa ggcgtgcccg 26580 
gtgctgaacc gcagccctaa aggatttctc atcggcggct cagcgcccgt ctagccagcg 26640 
cgcccaggcc cagggccaac gtaaggccga cggccaccaa cgccacagcc gtaatcgggc 26700 
gacgatcggg acccggctcc accaccgggg gtggaagtcg tctgacgttg tatggcgccg 26760 
aagcagggcc gggcggaatg tcccacgtca gcgcggccac cgcatcgatg acgccggcgc 26820 
cgaccaggtc gtcgaccccg cccccggggt gtctcgcggt ggcggtgatc cggtggatga 26880 
tctgcgccgg cgtcaggtcg gggaaccgct gccgaagcag ggccgccaga cccgacacat 2 694 0 
atgccgcggc aaacgaggtg ccggcgatgg gtaccggccc ctcccggcct tgcagcgcat 27000 
tcaccggttc accggtgtcg ccgagcgcga cgatgttttc tgcgggcgcg gccacgtcca 27060 
cccacggtcc -gtgcatcgag-aacgagctgg-gcatcceggt— ctggecgafea-GGgGGgaGgc-2.7J.-2.0 



ttaacaccag cggtgcgtac cacgccgggg tgacaacggt ctgcacattg ttccagccgc 27180 
gtgggtcgcc gggtgtggac gggtccggcg ccggattctg tacgcaatcg ccaccggtgt 27240 
tgccggccgc gaccaccacc accacgcctt tgacgttgac cgcatagtcg atggatgcac 273 00 
ccagtgaggt ttcatcgatc ggcctgctca ccttgtagca ggcggcttca ctgatgttga 27360 
tcacacccac gccgaggttg gcggcgtgca ccacggcgcg ggcaagactg cggatggaac 27420 
cggcggccgg ggtggcgttg gggtcattcg ggttggcttg tgagccgacc ggttcgaagg 274 80 
cctcagacgt ctgacgtagc gagagcagtc gagcgtcggg -cgcgacgccg acgaacccgt 2 754 0 
cggtgggcgc gggccggccc gcgatgatgg atgctgtgag agtcccatgg gcatcacagt 27600 
cagacaggcc gttaccggcc tggtcgacga aatcgccgcc aggttccgcc gggacccgtg 27660 
gcgaagcgtc gacaccggtg tcgatcaccg ccaccgtcac cccggccccg gtcgcgaact 27720 
tgtgggcatc ggccacgccc agatacgtgt tgctccacgg cggatcgtgg aacccggacc 27780 
ccggcagcgt ggtgggcgac gcgcacaaaa cgcgctgttc ggtaggctga tccgggcccg 27840 
tcacgtcggg cggcaacgcg cccggatcga tcggcggtgg cgtgatggcc gatgcgggcg 27900 
acgcggtgag caacgccagc gccaccgtga tcagaaagat acggtgcact cccagaacac 27960 
tccattcgtt gagattcatt gcgattcatt gagctgcgtt gctaccttgg gccacttgac 28020 
ggacctgtgt gcattttaga cgtaacggct gggcaaacaa cgctgtcacg cctgggctgg 28080 
tccgccgcgc cgaccagggc gcgtaggcgc tgtacctgga ccacgccggg actcaacggt 2 814 0 
tttgctaccg cactagccga tatgcggctg ctaccaaacg atcgcggcca tgtctcggtt 2 82 00 
gtctgagcac acgctgcgta tcgcggcatc gatgtcggtg gcggtgatga tctgcagatc 2 82 60 
ctgaaccgat accggttggc ccgcacgttt ttgcgcaacc acccgggtgt cccggaaccc 28320 
ttcggcgcgt tcgatcacgt tgcgggcgaa ccgaccgttt tgcatagcgt cgataccgtg 28380 
ctgcccacta ggggtggtgt agttacggat ggtggtgacc gcgtcgagga atacctcccg 28440 
tgcggcgtca tcgagctggc tggcgcgcgg tgtagcgtag cggtgtccaa tctcgacgat 28500 
ctccaccggc gaataagact cgaaccgcag ctttcggttg aaccggccag ccaaacccgg 2 8560 
gttcacggtg aggaattcat ccacctgatc ctcatagccg gccccgatga aacagaagtc 28620 
gaatcggtgt gtttccaatt gaaccaggag ttgattgacc gcctccatgc cgatcatgtc 28680 
cggtgttccg tcttgatgac gttcgatcag cgagtagaac tcgtccatga aaatgattcg 28740 
cccgagtgac ttttcgatca gctcgttcgt cttgggtcct gactccccga tgtagtgccc 28800 
acagaagtcc gatcggcgaa cttctcgaat ttcggggtga cgcacgatcc ccatgccggc 28860 
gtagatcttg ccgagcgctt cagcggtggt tgtcttacct gtgcctggtg gccccaccag 28920 
caacatgtgg ttggtctgcc cctccaccgg taggccgtgc tctaggcgca tcatgcgcac 28980 
ctcgagttgg tcttccagcg ccgataccgc ttgcttgacc gccgccaggc ccacctgttt 29040 
ggccagcagt tcccggccct cggctagcag ctcgccgcgc cgctgcgctg cattgtcgtc 29100 
atcgagctgg tcgcggcttt tcgccgtcga agcatcccaa cggtcggagc ggctggcgat 2 9160 
ggttcgttca tcggtaacaa tcaagcgcag gttcgggtcc gccagggctt ctttggcggc 29220 
gtcggtgagc accccgttga tggtggcctt cgacagccag atctgggcct tgtcctcctc 2 92 80 
atgcagttgc cggtacacca tcccccgcac atacgccaag tcggcgacca gcagcggaat 29340 
atcggccggt ccgatcgccg cggtgagcac gtcggcgccg aaccgctccg atgacctgct 29400 
gtgtccgatc acgtccaccc ggtccagcca gtccagggcc actcgcccct gcccgagatg 2 94 60 
ggcggcggcg tgggctgcca gcgcacaaat cgacgcggtc accgccggca tgacgatcgc 2 952 0 
ctgtggcggc agatcctcgg cggccgtcga caacacgtcg ggccatcgct gcgtgacgta 2 95 80 
catcaggaac gcccgagcca gctgatgcca ctggtagttg cgccacgaat ccaatagctc 29640 
gcggtttgct aacagggcat cggccttcgc atactccccc gcgatcgtca acgccgacga 2 9700 
cagcgccagc cccacctgag atgcgtcggt caccgtgatc ccgatggatg gtcccagctg 2 9760 
gacctcagcg gccaacgtcc ggccgatccg cgtggtctcg cggtgcagcc actcgctatg 29820 

'ngg-crgt:tga*gc~tgctta-a'gcg-^aggccagatc---gcggtca^ 
cgcgtcggcc atcgacggat cggcctcggt ggcagccaca aactcaggca acgccgccac 29940 
gcatccctgg ccattcttga tcgtcatcgc ccgatcgaaa tgccggcgcg cagtgagtaa 30000 
atcacccatc gtgtccacca ttctcgacat cgccgccgct gtcaccgcgg ttgcaacgtg 30060 
tgtctgtcac tctgtgcctc aaattccgtt ggcaacgttc taccggccta tcgacatcgt 30120 
gaccggctca aggctgacat agcggttctc cgcacggaac atttccatct caaccagcca 3 0180 
gttttgtcct gccgcaccga ctttcaccgt tgcccgatcg atttgttcga tggtcacctc 30240 
gaagccatgc cgatcgctct cggacagcga ggtaccgggt cgggcaatgg tgatgacact 30300 
ggctggccgt ggcgtgggcg aaatcgcgac atcgacaccg ctgccttcag atttgccgtc 30360 
atcgccgttc ttgcgccgcc gcacgtactc cacgacgccg acagtggtgc gcggcgcggg 3 0420 
ccgtggtgtg ccgacgatgc tcaactgcgg catgcgtacg ctggcccaac gctcttggtc 3 0480 
gcgagtgtgc acacacaccc gctcaccggc accgacgacg cgaatcacga tcctcttggc 30540 
gatcgtgtcg tccgcggcca cgaagacgcg cgacagctca ccggcgtcgg taacgggaat 3 06 00 
catcagccgg tccccgttgc tcagcttgcc aatcaacacc cccgacggtc cgatctcggt 30660 
gactagctgc gccggcaacg ggcagcgccg ctgtccgcgt aggtgtggac gtggcccgca 30720 

—catgttggcc—gcagccgcgg-cggcttgctc—accattgagc-cgacgcaaga—tcacaetggg— 30-7-80 




cggggtaggc 
atacaccgtt 
ccgcgacgaa 
gatagctttc 
ctcagccaag 
ggcaacgcgt 
ttgggtgttg 
cggcggcgta 
cgagccgaag 
cggcgcgagc 
cgtagtcgcc 
cacgcggacg 
gggacgccga 
tacgaacacc 
cgcccagccc 
gcgcgggctg 
gccattgcca 
cggtgatc 



gccggcgtcg 
acgttctgga 
atcgcctcag 
cagcgctgcg 
tctgttgcgg 
tgggcgaccg 
ccgatcaccg 
ccgatctccg 
gtgacgacgc 
tcaacgacgt 
cggtgcgctc 
ccacccccgg 
ccccgcagcc 
aaggcgacgc 
gccgacgcga 
aacccggtca 
gcacaccggt 



gtgtgcgcac 
tgacctcatc 
ccgggtacgc 
cactcccggc 
tggccagttt 
aaatggcagc 
gaagccgcat 
cgtcatacac 
tgattgaatc 
caatcacgtt 
ggccgagaag 
atcggttgtt 
ccagtaccgc 
cgaccacgac 
gcacgaacac 
gcttggacgt 
ggccactgcg 
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ggtgatggtc 
ggcacgcagc 
atacgtcgtc 
taccgcgtcc 
ggcgcgcaag 
ggccccaacg 
gatcagccac 
ccgcgggtaa 
gagttccagg 
gtcgctttct 
ttgcaccgcc 
ggccaaggta 
ccacgaccac 
agccatgacc 
tgtccacacc 
caacgcgccc 
ccgacgaacc 



gcggtgcacg 
gtccaggctt 
atccacccgg 
gaccccagcc 
ccctgacagc 
ctggtacgcc 
gtttcgcgcc 
tcgccgacgg 
tccagcgggt 
acggtcaccg 
accaccgcga 
attgggtcat 
gccggctgac 
gcacctagct 
ccggcgaccc 
tccgtagccg 
cgatagcgat 



tcgcgtccgg 
gcgagagaac 
cttcaccgcg 
ggcgatcaag 
gcagggagct 
agcgtaaagc 
gcccggcata 
tgccggttcg 
ggcgcagcaa 
acccggtgac 
caccgtcttg 
tccatttgac 
cccaccactg 
ggccgcccag 
gcctcgcact 
agccccgatt 
attgcgcgcc 



30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31808 



<210> 2 
<211> 13773 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> Complete DNA sequence of RDl Rv3 867 -3 877 
<400> 2 

atggtggacc cgccgggcaa cgacgacgac cacggtgatc tcgacgccct cgatttctcc 60 

gccgcccaca ccaacgaggc gtcgccgctg gacgccttag acgactatgc gccggtgcag 120 

accgatgacg ccgaaggcga cctggacgcc ctccatgcgc tcaccgaacg cgacgaggag 180 

ccggagctgg agttgttcac ggtgaccaac cctcaagggt cggtgtcggt ctcaaccctg 24 0 

atggacggca gaatccagca cgtcgagctg acggacaagg cgaccagcat gtccgaagcg 3 00 

cagctggccg acgagatctt cgttattgcc gatctggccc gccaaaaggc gcgggcgtcg 360 

cagtacacgt tcatggtgga gaacatcggt gaactgaccg acgaagacgc agaaggcagc 42 0 

gccctgctgc gggaattcgt ggggatgacc ctgaatctgc cgacgccgga agaggctgcc 480 

gcagccgaag ccgaagtgtt cgccacccgc tacgatgtcg actacacctc ccggtacaag 540 

gccgatgact gatcgcttgg ccagtctgtt cgaaagcgcc gtcagcatgt tgccgatgtc 600 

ggaggcgcgg tcgctagatc tgttcaccga gatcaccaac tacgacgaat ccgcttgcga 660 

cgcatggatc ggccggatcc ggtgtgggga caccgaccgg gtgacgctgt ttcgcgcctg 72 0 

gtattcgcgc cgcaatttcg gacagttgtc gggatcggtc cagatctcga tgagcacgtt 780 

aaacgccagg attgccatcg gggggctgta cggcgatatc acctacccgg tcacctcgcc 840 

gctagcgatc accatgggct ttgccgcatg cgaggcagcg caaggcaatt acgccgacgc 900 

catggaggcc ttagaggccg ccccggtcgc gggttccgag cacctggtgg cgtggatgaa 960 

~ggcggttgt'c~t'a'cggcgcgg~ccgaacgctg~ga'ccgacgtg~a 102 0 

tgggaaatgg ccggacaagt ttttggccgg cgcggccggt gtggcgcacg gggttgccgc 1080 

ggcaaacctg gccttgttca ccgaagccga acgccgactc accgaggcca acgactcgcc 1140 

cgccggtgag gcgtgtgcgc gcgccatcgc ctggtatctg gcgatggcac ggcgcagcca 1200 

gggcaacgaa agcgccgcgg tggcgctgct ggaatggtta cagaccactc accccgagcc 1260 

caaagtggct gcggcgctga aggatccctc ctaccggctg aagacgacca ccgccgaaca 13 2 0 

gatcgcatcc cgcgccgatc cctgggatcc gggcagtgtc gtgaccgaca actccggccg 1380 

ggagcggctg ctcgccgagg cccaagccga actcgaccgc caaattgggc tcacccgggt 1440 

taaaaatcag attgaacgct accgcgcggc gacgctgatg gcccgggtcc gcgccgccaa 1500 

gggtatgaag gtcgcccagc ccagcaagca catgatcttc accggaccgc ccggtaccgg 1560 

caagaccacg atcgcgcggg tggtggccaa tatcctggcc ggcttaggcg tcattgccga 162 0 

acccaaactc gtcgagacgt cgcgcaagga cttcgtcgcc gagtacgagg ggcaatcggc 1680 

ggtcaagacc gctaagacga tcgatcaggc gctgggcggg gtgcttttca tcgacgaggc 174 0 

ttatgcgctg gtgcaggaaa gagacggccg caccgatccg ttcggtcaag aggcgctgga 1800 

cacgctgctg gcgcggatgg agaacgaccg ggaccggctg gtggtgatca tcgccgggta 1860 

""ca"gct'ccga"c~atagat*cggc— tgctggaaac— ca-acgagggL ctgcggtcgc— ggttcgccarc — — l-9i 



tcgcatcgag 

cgctgctgat 

gcagttggag 

gtatgcgcgc 

cctcgatatc 

ggaggctatc 

cctcaccacc 

catcgtgcgc 

cgctcttggc 

caaaccacaa 

ttacgtgctg 

gctgggcaat 

gggccagacc 

ctcgatctgg 

gaccgcggtc 

cgaagcggtg 

cgccatagat 

gccaaccccg 

gccgccgata 

cggatcggtc 

cggcatcgcg 

gctggtcgcg 

cccctcaccg 

gtgctggtca 

ccggcatctg 

ggcgaccgtt 

caccgaatcg 

gacagccaag 

cctggtcgac 

cgctgaccct 

aagaagttca 

acgccgcccg 

tacgtgatgg 

cagctgtcgc 

ctggccggta 

gagtacctgc 

gtggcgttct 

caacggcagt 

ggtgaccagc 

gccagcgcag 

ctacgaaccc 

actatcgcga 

cacctagccg 

gacgaccccg 

gatgcggccg 



ttcgacacct 

gattcggcgt 

cagcgcatgt 

cagctggtgg 

gacaccctcg 

gccgcggtgc 

aaggttcagg 

cgggacaccc 

atcgtcgtcg 

ggcaaactcg 

ctgtccggac 

ccggccaacc 

gttggaatcc 

accctatgcg 

atcgcgatgc 

ctggtgtcct 

ctgaccgacc 

atctcggagg 

ccggcggcgg 

ttccagatcc 

caggtcaacg 

ccaccggcaa 

ctacccgatg 

tggcaacgca 

ccgatatcgc 

tacctcgacg 

atgtactaca 

tcgctgggcc 

ggtccggtgc 

agcccccgaa 

ctcccaccat 

atgacctggg 

gtggcgccat 

cgtacatgtt 

gcaccggtgg 

ggtatttggc 

tctcctacca 

ggtcccggcc 

cggcggtgga 

cacctcagcc 

atggattgat 

tcggcgggga 

tgttccaccc 

actggtcctg 

ggtccacccg 



gcgcgcgggc 
ctgaccggcg 
ttgggcaacc 
gaccggctcc 
cagcaagcca 
gacaagacgt 
caaagtgtcg 
cggctaaaga 
aaagagggcg 
gggaagtccg 
caggtgaatc 
cttccgcaca 
atgggcgagg 
^ atgaaagtcg 
cgcggtgccg 



cacacgcac'c" 
gcaaggctgg 
atcgcggctc 
ctaaccaatc 
gccgtatcgc 
cgcgggtcca 
aggagataac 
tcccgtttgg 
cggaattcgg 
aattcctgcg 
tcctgctcac 
ctgccgctgt 
tgttgaccgg 
gcgcggccgg 
acctaccccc 
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attcccccga ggaactcctc gagatcgcca acgtcattgc 
tgaccgcaga ggcggccgag aactttcttc aggccgccaa 
tgcgcggccg gcgcgccctg gacgtcgccg gcaacggtcg 
aggccagcga gcaatgccgg gacatgcgtc tagcccaggt 
acgaagaccg gcttcgcgag atcaacggct cagatatggc 
acgcacacct caacatgaga gaatgaacta tggggcttcg 
ttagcggctg gcgttttctg ctgcgccggc tcgaacacgc 
ggatgtttga cgacccgctg cagttctaca gccgctcgat 
cggtcctgat tctggcgggt gccgcgctgc tggcgtactt 
gcggcaccag cctgttcacc gaccgcgcga ccaaccagct 
agttgcatcc ggtctacaac ctgacttcgg cgcggctggt 
cggccaccgt gaagtcctcc gaactgagca agctgccgat 
ccggcgcccc ctacgccacg cctgtttcgg cgggcagcac 
acaccgtcgc ccgagccgac tccacttccc cggtagtgca 
cgttggagat cgatgcttcg atcgatccgc tccagtcaca 
accagggcga aacctggatc gtcacaacta agggacgcca 
gcgccctcac ctcgtcgatg gggataccgg tgacggccag 
gcatgttcaa cgcgctgcct gatatggggc cctggcagct 
gcgcgcccaa ttcgcttggc ctacctgatg atctagtgat 
acaccgacaa gggcccgcaa tactatgtgg tgctgcccga 
cgacaaccgc tgcggcgctg cgcgccaccc aggcgcacgg 
tggtgcccag tctggtcgtc agaatcgccg aacgggtata 
aaccgctcaa gatcgtgtcc cggccgcagg atcccgcgct 
gcgccggcga ccagtcgccg cagtcaacgg tgctgtccgg 
cctcagcgat gaacatgggg atcaagcaga tccacgggac 
gcggaaaatt cgtggcactg caatcccccg atcctcgata 
tcgatccaca gggcgtgcgt tatggggtgc ctaacgcgga 
tgagttcacc ccaaaacgcg ccctgggaga tcgttcgtct 
tgtcgaaaiga tgccgcactg ctcgagcacg acacgctgcc 
aagttcccgc cggagcctcc ggagccccct gatgacgacc 
tacccgtggc ccccggttga ccccgggcga gatcagcctc 
catcgacatc ccaccgtcgg gcgtccaaaa gatccttccc 
gctcggcatg atcgccatca tggtggccgg cggcaccagg 
gatgatgccg ctgatgatga tcgtgatgat ggtcggcggt 
tggcggcaag aaggtgcccg aaatcaacgc cgaccgcaag 
aggactacgc acccgagtga cgtcctcggc cacctctcag 
cgcaccgcat cccgaggatc tgttgtcgat cgtcggcacc 
ggccaacgcc gacttctatg cggccacccg aatcggtatc 
tcgattattg aagccggccg tcggcgggga gttggccgcc 
gttcctggag ccggtcagtc atatgtgggt ggtcaagttt 
ccatgactgc ccgaaactgc tgcaactccg tacctttccg 
cttggcgggg gcagccggcc tgatgacggc gatgatctgt 
accggacctg ctgcagatcc gggtgctcac cgaggaaccc 
gctcaaatgg cttccgcacg tacagcacca gaccgaaacc 
gctgatcttc acgcgccagg aaggtctgtc ggacctggcc 
—cgattcgctt— cccggcggcc-cctacgtagt— cgtcgtcgae- 
attcccgccc gacggtaggg ccggtgtcac ggtgatcacg 
ggcctaccgc atcagggtgc acgaggatgg gacggctgat 
gtttcgccag gtgacatcgg tcaccgatcg gatgtcgccg 
gcgaaagttg gccggatggt ccatcacggg caccatcctc 
gaagaaggtg gccaccgact ggcaccagct ggtcggtgcg 
accttcccgc tggaggatgt acaccgacac cgaccgtgac 
tcatgaacta aagaccggca acgtcatgta cctggacatc 
cgccggaccg cacggcatgc tcatcgggac cacggggtct 
caccctgatc ctgtcgctgg tggcaatgac tcatccagat 
cgacttcaaa ggtggttcaa ccttcctggg aatggaaaag 
cgtcaccaac atggccgagg aagccgagct cgtcagccgg 
agaactcgat cggcgccagt cgatcctccg acaggccggg 
agccctgtcc ggcgtggccg aatacgagaa gtaccgcgaa 
gctgccaacg cttttcgtcg tcgtcgacga gttcgccgag 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
—4680- 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 



ctgttgcaga g trc-a-cc'cgga— ctrtncartxzggg c L g t tcgacc-ggat-ebgeeg-egfeegggegg: 55^ 



tcgctgaggg 

gacaaactgg 

tccaaggcgg 

gggtttctcc 

ccatacatg.c 

cagaccacta 

gcgccgacac 

gccaacggcc 

actgctgaac 

gaatcgcgtg 

ctcatcgccc 

cgccgccatc 

attgggggcg 

gccgccacac 

ctgatctatc 

aaggtcaacc 

aaggaacacc 

cccgttgcgt 

gtcggcgagt 

ttcggcgtcc 

gactacctcg 

cggattaccc 

catctgatga 

gcgatcaccg 

cgggtcctgc 

tccgactacc 

gctcactgcc 

aagacgacca 

gtgcggttca 

ctgctgggcg 

ctggcggtca 

tcgcgttcgt 

gtgggtgccg 

gcagatatcg 

atggacaagt- 

gagaagcagg 

tttctcgtct 

gaagtgttcg 

cccgagctca 

ataaccgaac 

aaaaatgtca 

gcacggcgtg 

ggccgatgag 

ggcttccaat 

cgcccgcacc 



tccatctgct 
agccaaacct 
taatcggcac 
gggtcggcat 
cgccggcggc 
gacaagccgc 
cgtgacccgc 
cgcgccggcg 
cggaagtacg 
cgtacaagat 
gtgatcggcg 
tacaggatgt 
cacctcaaac 
actcaccgcg 
tcgaaaacct 
gggtggtcgc 
gagtgggctc 
ccgatccata 
tccccgacct 
acgtcatcat 
gcaccaagat 
gcgagatccc 
tcggcgtgcc 
cgggggtgac 
cggagcgtat 
gcactcgctg 
acatgcacac 
ttgcccacgc 
tgctcgcgga 
ccggcgcgat 
acctgaagaa 
ggtggagcgg 
ccggggggat 
ggttgcacat 
tcgtcggcgc 
aattcccatc 
cgccagacgg 
cagcaccccc 
gcccggtaat 
ggtttgtgta 
catgatccga 
acggccggct 
gtctccgccc 
gcatcggccc 
tattcgcaaa 



gctggctacc 
gacatatcga 
accggaggcg 
ggaagacccg 
aggcgtcgaa 
gcgcattcac 
gccggcgacg 
acgatgcaaa 
gacgctgcgc 
gtggctgccg 
acaacccctg 
gtggggcgta 
cgggaagtcg 
caacgttcag 
tccacacgtc 
agagatgcaa 
gatcgggatg 
cggcgacgtc 
tgaggggcag 
ctccacgcca 
cgagttccgg 
ggcgaatcgt 
caggttcgac 
gcagatcgct 
ccacctgcac 
ggagattccg 
gaacccgcac 
gatcgcgcgc 
ctaccgctcg 
caaccgcaac 
gcggttgccg 
atttgacgtc 
gccgccgatg 
cattgtcacc 
cgcattcggg 
cagtgagttc 
caaagaggtc 
aagcgccggt 
cgagttcggg 
cgggatacaa 
tcgctgccga 
cgacggcgct 
aagcggcgac 
aagaccagct 
tcgacgacgg 
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cagtcgctgc 
atcgcattgc 
cagtacatca 
gtcaagttca 
accaatggtg 
aggttcaccg 
atgcaaagcg 
gcgcagcgat 
gaggttgtgc 
ccgttgacca 
cgatttgccc 
gacgtttccg 
acgctactgc 
ttctattgca 
ggtggggtag 
gccgtcatgc 
taccggcagc 
tttctgatca 
gttcaagatc 
cgctggacag 
cttggtgacg 
ccgggtcggg 
ggcgtgcaca 
tcccagcaca 
gaactcgatc 
atcggcttgc 
ctactgatct 
gccatttgtg 
ggcctgctgg 
agcgcgtcgc 
ccgaccgacc 
gtgcttctgg 
gcaccgctgg 
tgtcagatga 
tcgggcgctc 
aaggtcaagc 
atccaggccc 
taagattatt 
caatgctgac 
atacagggag 
cattggcacg 
gacgtcggtg 
ggcgttcaca 
ccaccgtgcg 
cgccgccggc 



agaccggcgg 
gcaccaccag 
ccaacaagga 
gcaccttcta 
aagccggagg 
cggcaccggt 
cagcgatgag 
gaggaggagc 
tggaccagct 
atccggtccc 
tggggatcat 
gggccggcgg 
agacgatggt 
tcgacctagg 
ccaatcggtc 
ggcaacggga 
tgcgtgacga 
tcgacggatg 
tggccgccca 
agctgaagtc 
tcaatgaaac 
cagtgtcgat 
gcgccgataa 
ccgaacaggc 
cgaacccgcc 
gcgagacgga 
tcggtgcggc 
cccgaaacag 
acgcggtgcc 
tagacgaggc 
tgacgacggc 
tcgacgattg 
ccccgttatt 
gccaggctta 
cgacaatgtt 
ggcgcccccc 
cctacatcga 
tcattgccgg 
catcgggttt 
ggaagaagta 
caagtgagcg 
accgggctgg 
tcggagggca 
ggcgaagcgg 
gtcttcgccg 
"atgccaccgg 
gcggcggccg 
accgcgcgcc 
gcggctgcaa 
gcgatgcagg 
ctgccggaga 
ggtatcaaca 
caggcagccc 
aagctcgagc 
atcttcggaa 
acccagaccc 
ctgcagcagg 
gacgaggaag 
gctggtggat 
gcaggtgggt 
-cxrcreg-grga- 



tgttcgcatc 
ctctcatgaa 
gagcggtgtc 
catcagtggg 
gcccggtcaa 
tctcgaggag 
gaggagcggc 
ggcgcgcatg 
cggcactgct 
gctcaacgag 
ggatgaaccg 
caacatcggt 
gatgtcggcc 
tggcggcggg 
cgagcccgac 
aaccaccttc 
tccaagtcaa 
gcccggtttt 
ggggctggcg 
gcgtgttcgc 
ccagatcgac 
ggaaaagcac 
cctggtggag 
acctccggtg 
gggaccagag 
cctgacgccg 
caaatcgggc 
tccccagcag 
ggacacccat 
cgttcaagca 
gcagctacgc 
gcacatgatc 
gccggcggcg 
caaggcaacc 
cctttcgggc 
tggccaggca 
gcctccagaa 
tgtagcagga 
gtttccggct 
ggcaaatgga 
acaacgctct 
ttcccgcggg 
tccaattgct 
tccaggacgt 
aataggcccc 
agctaaatac 
cgggatggca 
tgaactctct 
cgccgatggt 
cgacggcgca 
tcgccgccaa 
cgatcccgat 
tggcaatgga 
cgatggcgtc 
tgccctcccc 
tcggccaact 
tgacgtcgtt 
ccgcgcagat 
caggccccag 
cgttgacccg 
-fgic'cggcg^cfc- 



5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
-83-40- 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 

-92^ro- 



caacacatcg 
cgcacggctg 
gacgctttcg 
gggagaagcc 
ggtctggcta 
agccgcggca 
ccacatcacc 
cgcgttgacc 
ggtctaccag 
gatccttgat 
tggcagctca 
gggtgagatg 
gttcagccag 
gggcctgctc 
cgcgggcgcg 



gagggagtga" 

atggccggcg 

gcggctctgg 

tggactggag 

caaaccgcgt 

tacacccagg 

caggccgtcc 

gagatggatt 

gccgagaccg 

cccggcgcga 

acaccggttg 

agcggcccga 

gtgggcggca 

ggcaccagtc 

ggcctgctgc 



cacgccgccg acgccccagc 



"tcaccat^gctT" 

cgggtccggc 

acgctcaggc 

gtggcagcga 

caacacaggc 

ccatggccac 

ttacggccac 

atttcatccg 

cggttaacac 

gccagagcac 

gccagttgcc 

tgcagcagct 

ccggcggcgg 

cgctgtcgaa 

gcgcggagtc 

cgaccgaaaa 



"gtggcacgca 
tccaatgctt 
cgtcgagttg 
caaggcgctt 
caagacccgt 
gacgccgtcg 
caacttcttc 
tatgtggaac 
gcttttcgag 
gacgaacccg 
gccggcggct 
gacccagccg 
caacccagcc 
ccatccgctg 
gctacctggc 
gccggtrg?nr 
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tgctgccgga tcgtcggcga cgggtggcgc cgctccggtg ggtgcgggag cgatgggcca 9300 
gggtgcgcaa tccggcggct ccaccaggcc gggtctggtc* gcgccggcac cgctcgcgca 9360 
ggagcgtgaa gaagacgacg aggacgactg ggacgaagag gacgactggt gagctcccgt 9420 
aatgacaaca gacttcccgg ccacccgggc cggaagactt gccaacattt tggcgaggaa 9480 
ggtaaagaga gaaagtagtc cagcatggca gagatgaaga ccgatgccgc taccctcgcg 954 0 
caggaggcag gtaatttcga gcggatctcc ggcgacctga aaacccagat cgaccaggtg 96 00 
gagtcgacgg caggttcgtt gcagggccag tggcgcggcg cggcggggac ggccgcccag 9660 
gccgcggtgg tgcgcttcca agaagcagcc aataagcaga agcaggaact cgacgagatc 9720 
tcgacgaata ttcgtcaggc cggcgtccaa tactcgaggg ccgacgagga gcagcagcag 97 80 
gcgctgtcct cgcaaatggg cttctgaccc gctaatacga aaagaaacgg agcaaaaaca 9840 
tgacagagca gcagtggaat ttcgcgggta tcgaggccgc ggcaagcgca atccagggaa 9900 
atgtcacgtc cattcattcc ctccttgacg aggggaagca gtccctgacc aagctcgcag 9960 

cggcctgggg cggtagcggt tcggaggcgt accagggtgt ccagcaaaaa tgggacgcca 10020 

cggctaccga gctgaacaac gcgctgcaga acctggcgcg gacgatcagc gaagccggtc 10080 

aggcaatggc ttcgaccgaa ggcaacgtca ctgggatgtt cgcatagggc aacgccgagt 10140 

tcgcgtagaa tagcgaaaca cgggatcggg cgagttcgac cttccgtcgg tctcgccctt 10200 

tctcgtgttt atacgtttga gcgcactctg agaggttgtc atggcggccg actacgacaa 102 60 

gctcttccgg ccgcacgaag gtatggaagc tccggacgat atggcagcgc agccgttctt 10320 

cgaccccagt gcttcgtttc cgccggcgcc cgcatcggca aacctaccga agcccaacgg 103 80 

ccagactccg cccccgacgt ccgacgacct gtcggagcgg ttcgtgtcgg ccccgccgcc 10440 

gccaccccca cccccacctc cgcctccgcc aactccgatg ccgatcgccg caggagagcc 10500 

gccctcgccg gaaccggccg catctaaacc acccacaccc cccatgccca tcgccggacc 10560 

cgaaccggcc ccacccaaac cacccacacc ccccatgccc atcgccggac ccgaaccggc 1062 0 

cccacccaaa ccacccacac ctccgatgcc catcgccgga cctgcaccca ccccaaccga 10680 

atcccagttg gcgcccccca gaccaccgac accacaaacg ccaaccggag cgccgcagca 10740 

accggaatca ccggcgcccc acgtaccctc gcacgggcca catcaacccc ggcgcaccgc 10800 

accagcaccg ccctgggcaa agatgccaat cggcgaaccc ccgcccgctc cgtccagacc 10860 

gtctgcgtcc ccggccgaac caccgacccg gcctgccccc caacactccc gacgtgcgcg 1092 0 

ccggggtcac cgctatcgca cagacaccga acgaaacgtc gggaaggtag caactggtcc 10980 

atccatccag gcgcggctgc gggcagagga agcatccggc gcgcagctcg cccccggaac 11040 

ggagccctcg ccagcgccgt tgggccaacc gagatcgtat ctggctccgc ccacccgccc 11100 

cgcgccgaca gaacctcccc ccagcccctc gccgcagcgc aactccggtc ggcgtgccga 11160 

gcgacgcgtc caccccgatt tagccgccca acatgccgcg gcgcaacctg attcaattac 11220 

ggccgcaacc actggcggtc gtcgccgcaa gcgtgcagcg ccggatctcg acgcgacaca 112 80 

gaaatcctta aggccggcgg ccaaggggcc gaaggtgaag aaggtgaagc cccagaaacc 11340 

gaaggccacg aagccgccca aagtggtgtc gcagcgcggc tggcgacatt gggtgcatgc 11400 

gttgacgcga atcaacctgg gcctgtcacc cgacgagaag tacgagctgg acctgcacgc 11460 

tcgagtccgc cgcaatcccc gcgggtcgta tcagatcgcc gtcgtcggtc tcaaaggtgg 1152 0 

ggctggcaaa accacgctga cagcagcgtt ggggtcgacg ttggctcagg tgcgggccga 115 8 0 

ccggatcctg gctctagacg cggatccagg cgccggaaac ctcgccgatc gggtagggcg 1164 0 

acaatcgggc gcgaccatcg ctgatgtgct tgcagaaaaa gagctgtcgc actacaacga 11700 

catccgcgca cacactagcg tcaatgcggt caatctggaa gtgctgccgg caccggaata 11760 

cagctcggcg cagcgcgcgc tcagcgacgc cgactggcat ttcatcgccg atcctgcgtc 11820 

gaggttttac aacctcgtct tggctgattg tggggccggc ttcttcgacc cgctgacccg 11880 

cggcgtgctg tccacggtgt ccggtgtcgt ggtcgtggca agtgtctcaa tcgacggcgc 11940 

acaacaggcg tcggtcgcgtf~"tggaceggt"t gcgcaacaac ggftaccaag aftftggcgag 1'2'0*0'0" 



ccgcgcatgc gtggtcatca atcacatcat gccgggagaa cccaatgtcg cagttaaaga 12 060 

cctggtgcgg catttcgaac agcaagttca acccggccgg gtcgtggtca tgccgtggga 12120 

caggcacatt gcggccggaa ccgagatttc actcgacttg ctcgacccta tctacaagcg 12180 

caaggtcctc gaattggccg cagcgctatc cgacgatttc gagagggctg gacgtcgttg 1224 0 

agcgcacctg ctgttgctgc tggtcctacc gccgcggggg caaccgctgc gcggcctgcc 12300 

accacccggg tgacgatcct gaccggcaga cggatgaccg atttggtact gccagcggcg 12360 

gtgccgatgg aaacttatat tgacgacacc gtcgcggtgc tttccgaggt gttggaagac 12420 

acgccggctg atgtactcgg cggcttcgac tttaccgcgc aaggcgtgtg ggcgttcgct 12480 

cgtcccggat cgccgccgct gaagctcgac cagtcactcg atgacgccgg ggtggtcgac 1254 0 

gggtcactgc tgactctggt gtcagtcagt cgcaccgagc gctaccgacc gttggtcgag 12600 

gatgtcatcg acgcgatcgc cgtgcttgac gagtcacctg agttcgaccg cacggcattg 12660 

aatcgctttg tgggggcggc gatcccgctt ttgaccgcgc ccgtcatcgg gatggcgatg 12720 

cgggcgtggt gggaaactgg gcgtagcttg tggtggccgt tggcgattgg catcctgggg 12780 

atcgctgtgc tggtaggcag cttcgtcgcg aacaggttct accagagcgg ccacctggcc 12 84 0 
gagtgcctac tggtcacgac gtatctgctg^artrgca-a'C'crg— ccgcagcgct ggcrcgtgccg — 12900- 
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ttgccgcgcg 
ctgtttttga 
gccgtgatca 
gactgggtcc 
ctgaccgtcg 
gacaacgagg 
acctggcagg 
aaactggcca 
ggtgccatcg 
ttgatcacga 
gcgttgctgg 
tacccgcact 
gtggtggtcg 
gaattgatcg 
gtgtacgaca 



9g9tcaactc 
ccttgatgac 
ccgctatcgc 
ccgcgggggg 
cggtcgcgcg 
agttgctcga 
ccatcatcgc 
agcaacttct 
cggtcgtggt 
ccgtctgcgg 
cggcgacggt 
atgcctggct 
ggtcgatggc 
acggcgccat 
cggtccgcaa 



gttgggggcg 
gcggggcggc 
ggtcatcgcg 
gatcgcattc 
gatcgcgctg 
tcccgtcgcg 
gtcggtgccc 
gatcggatac 
gcgcgggcac 
atttcgctcg 
cgcgattccg 
gttgttgagc 
tcacgtccgg 
gatcgctgcc 
tatccggttc 



ccacaagttg 
cctcggaagc 
gccgccgctg 
gggctgttca 
ccgccgattc 
accccggagg 
gcgtccgcgg 
gtcacgtcgg 
ttctttgtac 
cggctttacg 
acgggtctga 
gtctacctca 
cgcgtttcac 
atcattccca 
tga 



ccggcgccgc 
gtcatgagtt 
ccttcggcta 
ttgtgacgaa 
cggtacccgg 
ctaccagcga 
tccggctcac 
gcaccctgat 
acagcctggt 
ccgagcgctg 
cggccaaact 
cggtagccct 
cggtcgtaaa 
tgctgctgtg 



tacggccgtg 
ggcgtcgttt 
tggataccag 
tgcggccaag 
cgaaaccgtg 
agaaaccccg 
cgagcgcagc 
tctggctgcc 
ggtcgcgggt 
gtgtgcgtgg 
catcatctgg 
ggttgcgctc 
acgaactctg 
gatcaccggg 



12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13773 



<210> 3 
<211> 3909 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RD1-AP34 (a 3909 bp fragment of the M. 
tuberculosis H37Rv genome) 



<400> 3 

gaattcccat 

tcgccagacg 

gcagcacccc 

agcccggtaa 

cggtttgtgt 

acatgatccg 

gacggccggc 

ggtctccgcc 

tgcatcggcc 

ctattcgcaa 

ggagggagtg 

gatggccggc 

ggcggctctg 

ctggactgga 

acaaaccgcg 

atacacccag 

ccaggccgtc 

cgagatggat 



ccagtgagtt 
gcaaagaggt 
caagcgccgg 
tcgagttcgg 
acgggataca 
atcgctgccg 
tcgacggcgc 
caagcggcga 
caagaccagc 
atcgacgacg 
atcaccatgc 
gcgggtccgg 
gacgctcagg 
ggtggcagcg 
tcaacacagg 
gccatggcca 
cttacggcca 
tatttcatcc 



caaggtcaag 
catccaggcc 
ttaagattat 
gcaatgctga 
aatacaggga 
acattggcac 
tgacgtcggt 
cggcgttcac 
tccaccgtgc 
gcgccgccgg 
tgtggcacgc 
ctccaatgct 
ccgtcgagtt 
acaaggcgct 
ccaagacccg 
cgacgccgtc 
ccaacttctt 
gtatgtggaa 



cggcgccccc 
ccctacatcg 
ttcattgccg 
ccatcgggtt 
gggaagaagt 
gcaagtgagc 
gaccgggctg 
atcggagggc 
gggcgaagcg 
cgtcttcgcc 
aatgccaccg 
tgcggcggcc 
gaccgcgcgc 
tgcggctgca 
tgcgatgcag 
gctgccggag 
cggtatcaac 
ccaggcagcc 



cgctTftTtcga 
cgacgaaccc 
cgccggcggc 
tgacccagcc 
gcaacccagc 
accatccgct 
cgctacctgg 
agccggttgc 
ccgctccggt 
cgggtctggt 
gggacgaaga 
ccggaagact 
agagatgaag 
cggcgacctg 
gtggcgcggc 
■caataa^cag 



ctggccaggc 

agcctccaga 

gtgtagcagg 

tgtttccggc 

aggcaaatgg 

gacaacgctc 

gttcccgcgg 

atccaattgc 

gtccaggacg 

gaataggccc 

gagctaaata 

gcgggatggc 

ctgaactctc 

acgccgatgg 

gcgacggcgc 

atcgccgcca 

acgatcccga 

ctggcaatgg 

ccgatggcgtT" 

atgccctccc 

ctcggccaac 

gtgacgtcgt 

gccgcgcaga 

tcaggcccca 

tcgttgaccc 

atgccggcgg 

gcgatgggcc 

ccgctcgcgc 

tgagctcccg 

ttggcgagga 

ctaccctcgc 

tcgaccaggt 

cggccgccca 

tcgacgagat" 



atttctcgtc 
agaagtgttc 
acccgagctc 
tataaccgaa 
aaaaaatgtc 
tgcacggcgt 
gggccgatga 
tggcttccaa 
tcgcccgcac 
ccaacacatc 
ccgcacggct 
agacgctttc 
tgggagaagc 
tggtctggct 
aagccgcggc 
accacatcac 
tcgcgttgac 
aggtctacca 
cgatcc"t"tga 
ctggcagctc 
tgggtgagat 
tgttcagcca 
tgggcctgct 
gcgcgggcgc 
gcacgccgct 
ctgctgccgg 
agggtgcgca 
aggagcgtga 
taatgacaac 
aggtaaagag 
gcaggaggca 
ggagtcgacg 
ggccgcggtg 
"ctcgacgaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 

Tr4-o- 

1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



ggccgagacc 
tcccggcgcg 
aacaccggtt 
gagcggcccg 
ggtgggcggc 
cggcaccagt 
gggcctgctg 
gatgtctcag 
atcgtcggcg 
atccggcggc 
agaagacgac 
agacttcccg 
agaaagtagt 
ggtaatttcg 
gcaggttcgt 



gcggftaaca 
agccagagca 
ggccagttgc 
atgcagcagc 
accggcggcg 
ccgctgtcga 
cgcgcggagt 
ctgatcgaaa 
acgggtggcg 
tccaccaggc 
gaggacgact 
gccacccggg 
ccagcatggc 
agcggatctc 
tgcagggcca 



gaagctcgag 
gatcttcgga 
tacccagacc 
gctgcagcag 
cgacgaggaa 
ggctggtgga 
cgcaggtggg 
cccctcggtg 
gggtgcggga 
cgcgccggca 
ggacgactgg 
tgccaacatt 
accgatgccg 
aaaacccaga 
gcggcgggga 
aageaggaacr 



gcgcgctccc aagaagcagc 
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attcgtcagg 
tcgcaaatgg 
agcagtggaa 
ccattcattc 
gcggtagcgg 
agctgaacaa 
cttcgaccga 
atagcgaaac 
tatacgtttg 
gccgcacgaa 
tgcttcgttt 
gcccccgacg 
acccccacct 
ggaaccggcc 
cccacccaaa 
accacccaca 
ggcgcccccc 
accggcgccc 
gccctgggca 
cccggccgaa 
ccgctatcgc 
ggcgcggctg 
gccagcgccg 
agaacctccc 
ccaccccgat 
cactggcggt 
aaggccggcg 
gaagccgccc 
aatcaacctg 
ccgcaatccc 
aaccacgctg 
ggctctaga 



ccggcgtcca 
gcttctgacc 
tttcgcgggt 
cctccttgac 
ttcggaggcg 
cgcgctgcag 
aggcaacgtc 
acgggatcgg 
agcgcactct 
ggtatggaag 
ccgccggcgc 
tccgacgacc 
ccgcctccgc 
gcatctaaac 
ccacccacac 
cctccgatgc 
agaccaccga 
cacgtaccct 
aagatgccaa 
ccaccgaccc 
acagacaccg 
cgggcagagg 
ttgggccaac 
cccagcccct 
ttagccgccc 
cgtcgccgca 
gccaaggggc 
aaagtggtgt 
ggcctgtcac 
cgcgggtcgt 
acagcagcgt 



atactcgagg 
cgctaatacg 
atcgaggccg 
gaggggaagc 
taccagggtg 
aacctggcgc 
actgggatgt 
gcgagttcga 
gagaggttgt 
ctccggacga 
ccgcatcggc 
tgtcggagcg 
caactccgat 
cacccacacc 
cccccatgcc 
ccatcgccgg 
caccacaaac 
cgcacgggcc 
tcggcgaacc 
ggcctgcccc 
aacgaaacgt 
aagcatccgg 
cgagatcgta 
cgccgcagcg 
aacatgccgc 
agcgtgcagc 
cgaaggtgaa 
cgcagcgcgg 
ccgacgagaa 
atcagatcgc 
tggggtcgac 



gccgacgagg 
aaaagaaacg 
cggcaagcgc 
agtccctgac 
tccagcaaaa 
ggacgatcag 
tcgcataggg 
ccttccgtcg 
catggcggcc 
tatggcagcg 
aaacctaccg 
gttcgtgtcg 
gccgatcgcc 
ccccatgccc 
catcgccgga 
acctgcaccc 
gccaaccgga 
acatcaaccc 
cccgcccgct 
ccaacactcc 
cgggaaggta 
cgcgcagctc 
tctggctccg 
caactccggt 
ggcgcaacct 
gccggatctc 
gaaggtgaag 
ctggcgacat 
gtacgagctg 
cgtcgtcggt 
gttggctcag 



agcagcagca 
gagcaaaaac 
aatccaggga 
caagctcgca 
atgggacgcc 
cgaagccggt 
caacgccgag 
gtctcgccct 
gactacgaca 
cagccgttct 
aagcccaacg 
gccccgccgc 
gcaggagagc 
atcgccggac 
cccgaaccgg 
accccaaccg 
gcgccgcagc 
C99cgcaccg 
ccgtccagac 
cgacgtgcgc 
gcaactggtc 
gcccccggaa 
cccacccgcc 
cggcgtgccg 
gattcaatta 
gacgcgacac 
ccccagaaac 
tgggtgcatg 
gacctgcacg 
ctcaaaggtg 
gtgcgggccg 



ggcgctgtcc 
atgacagagc 
aatgtcacgt 
gcggcctggg 
acggctaccg 
caggcaatgg 
ttcgcgtaga 
ttctcgtgtt 
agctcttccg 
tcgaccccag 
gccagactcc 
cgccaccccc 
cgccctcgcc 
ccgaaccggc 
ccccacccaa 
aatcccagtt 
aaccggaatc 
caccagcacc 
cgtctgcgtc 
gccggggtca 
catccatcca 
cggagccctc 
ccgcgccgac 
agcgacgcgt 
cggccgcaac 
agaaatcctt 
cgaaggccac 
cgttgacgcg 
ctcgagtccg 
gggctggcaa 
accggatcct 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3909 



<210> 4 
<211> 324 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3 861 
<400> 4 

gtgacctggc tggctgaccc ggtcggcaac agcaggatcg cccgagcgca ggcctgcaaa 6 0 
acgtcaatct cggcgcccat cgtcgaatcc tggcgggcgc aacgcggcgc gcaatgtgga 12 0 

Ii"a'gi5'g'c~g'aga~~a'a:ti2^t 

ctgttccgca gacccctcga accagcggtc caggcggcgg ttgcgtcatg ccgattgggc 240 
agacacccgg tggtcgcgca ccgggtaacc gttgcgctcg gccagggatc gcagctggcc 3 00 
caacgcgaat gcccgcgccc ggcc 324 



5 

348 
DNA 

Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3 862c-whiB6 
<400> 5 

atgcgatacg ctttcgcggc agaggctaca acctgcaacy ctttctggag gaacgtagac 60 



<210> 
<211> 
<212> 
<213> 
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atgacagtaa ccgccctgta tgaggtcccg ctcggcgttt gcacgcaaga tcccgatcgt 120 
tggacgacga ctcccgacga cgaggccaag accctgtgcc gggcttgccc gcgccggtgg 180 
ctgtgtgcac gcgacgccgt cgagtccgcg ggtgcggaag ggctgtgggc aggggtcgta 24 0 
attcccgaat caggccgggc gcgggcattc gcgttgggcc agctgcgatc cctggccgag 3 00 
cgcaacggtt acccggtgcg cgaccaccgg gtgtctgccc aatcggca 34 8 



<210> 6 
<211> 1176 
<212> DNA 

<213> Mycobacterium tuberculosis 



<220> 

<223> DNA sequence RV3 863 



<400> 6 

atggcgggcg 

acgcagatgt 

ggcacatcag 

gtcggcaccc 

gatggacggt 

ggcagctcgg 

aaggcgttga 

caaccatcag 

cccggcgtgg 

caacgcagct 

ggccgtagtt 

gctggaacca 

gcgtcccccg 

gccgccgtag 

ttcaccgaac 

caagcaaccc 

cgccaccacg 

cgcttatatg 

ggcgtcgcag 

accgaagcga 



agcggaaagt 
ctaaagcagg 
acgtgattga 
agatcgacga 
ggcggatcgt 
tgaccgtcag 
cgttcgaagt 
cggacctgtc 
ttcgcgcagg 
tggcggccga 
ggccccggga 
tcgcgcgaat 
gactccggcc 
acggctgcag 
gtgccgcgcc 
gcatcagccg 
acgaattaat 
ccgcacggcg 
aagaaatgat 
tcgaagcact 



ctgcccaccg 
gtcgactgtc 
gccccgtcgc 
ttctcgaatc 
cggcaacatc 
cgataagacc 
cgtcaggccg 
ggacgacccg 
ggcggccgcg 
cgggatcatc 
acggacccgg 
ccgtcggggc 
tgccgacggc 
tctggctatc 
gatccttgct 
gattaccccg 
gaggctggga 
gcgcgcgaac 
cgtcggcgcc 
gatccgtcag 



tcccggctag 
ggaccggcgc 
ggtgtcgcga 
tctcagacac 
ccgagaggta 
ctaatccgat 
tcggattccg 
gcgcacaacg 
gctgcgcgcc 
aacgcgggcg 
gcaaaactcg 
gagcccaccg 
ccggcgtcct 
gcagcgttgc 
gatttgcgcc 
gaattgatca 
gcaaccgccc 
ctttccaccc 
gaagccgagg 
atcaat 



tacccgcgaa 
cgctggtcgc 
tcattggcca 
atctgcgagc 
tgttcgtcgg 
tcggcgatcc 
ctgcacagca 
ctgcgccggt 
gtcgtgaact 
cgctcatcgc 
aagaagtgct 
agcccgcaac 
tgatcgcgca 
cggcgaccga 
agctcgaggc 
aggcgttggg 
ctggtgccac 
tggagactgc 
aagagttgcc 



taagggatca 
gtgcagcggc 
ctcgtgccga 
ggtatccgat 
cggacgacgc 
ccctggaggc 
cggccgcgta 
cgcaccggac 
tgacatcagc 
gttcgagaaa 
gcagtggccc 
aaaccccgac 
ggctgtcacc 
ggaccccgag 
gattgccgtc 
cgcggtacgt 
actggcgcag 
ccaagcggcc 
agccgaggcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1176 



<210> 7 
<211> 1206 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3 864 



<400> 7 

atggcatcgg 

ttgcttggag 

aaacaaatca 

gcggaagctt 

ctcgacaaat 

gacgtcctgc 

gaaaagattc 

atcgcgatgg 

gcgaccaacc 

ttccccggcc 

aagttgcccg 

tggccgccca 

gggttcccgg 



gtagcggtct 
agggaatccc 
gcgacaaaat 
acctaaacca 
taaccggcaa 
gggccatgaa 
cgctgctcgg 
ccgttgtcgg 
tgaggggaat 
tgcccgggtt 
acattccgat 
cccccggcag 
agttccccgc 



ttgcaagacg 
cgacccaggc 
gggactcgcc 
gaacatcgcg 
catgatctcg 
gaagatgatt 
ccacttgtgg 
cggcgcattg 
tctcggcagg 
gcccagcctg 
ccccggcctg 
cccgttgttc 
catccccggg 
gccgggtctg 



acgagtaact 
gacattttca 
attccgggca 
caacaacttc 
aatcaggcca 
gacggtgtct 
tcgtgggagc 
ctctatctaa 
ctgatcgaga 
cccgacatca 
cccgacatcc 
cccgacctcc 
ttccccgcac 
ggcgacctgc 



ttatttgggg 
acaccggttc 
ccaactggat 
gcgcacaggt 
aatacgtctc 
acaaggtttg 
tcgcaatccc 
cgattatgac 
tgttgacgac 
tcgacggcct 
cgggcctacc 
cgtcgttccc 
tgcccgggtt 
tgcccggcgt 



ccagttactc 60 
gtcgctgttc 120 
cggccaagcg 180 
gatgggcgat 240 
cgatacgcgc 300 
taagggcctc 360 
tatgtccggc 420 
gctgatgaat 4 80 
cttgccaaag 540 
ctggccgccg 600 
cgacttcaaa 660 
agggttcccc 72 0 
gcccagcatt 7 80 
aggcgatttg 840 



cccaacttgt tccccggctt 




ggcaagttac ccacctggac tgagctggcc gctttgcctg acttcttggg cggcttcgcc 900 
ggcctgccca gcttgggttt tggcaatctg ctcagctttg ccagtttgcc caccgtgggt 960 
caggtgaccg ccaccatggg tcagctgcaa cagctcgtgg cggccggcgg tggccccagc 1020 
caactggcca gcatgggcag ccaacaagcg caactgatct cgtcgcaggc ccagcaagga 1080 
ggccagcagc acgccaccct cgtgagcgac aagaaggaag acgaggaagg cgtggccgag 114 0 
gcggagcgtg cacccatcga cgctggcacc gcggccagcc aacgggggca ggaggggacc 1200 
gtcctt 12 06 



<210> 8 
<211> 309 
<212> DNA 

<213> Mycobacterium tuberculosis 

<220> 

<22 3> DNA sequence Rv3 865 
<400> 8 

atgaccggat ttctcggtgt cgtgccttcg 
gagatcgtgg gtgatatcaa aagggcgacc 
cagcttaccc atggttcgtt cacgtcgaaa 
acccgtagca gcacgggcac gggtttgcag 
ctcgcagccg ccggcgccta cctcaaggcc 
attttcggt 



ttcctgaagg tgctggcggg catgcacaac 60 
gatacggtcg ccgggattag cggacgagtt 120 
ttcaatgaca cgctgcaaga gtttgagacc 180 
ggagtcacca gcggactggc caataatctg 24 0 
gacgatggcc tagccggtgt tatcgacaag 3 00 

309 



<210> 9 
<211> 849 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<22 3> DNA sequence Rv3 866 



<400> 9 

atgacgggtc 

gtaaccatcg 

acgcttggga 

gtgcagcgtg 

gtcgcggaga 

cggcgcgaca 

gtgatcgcgg 

ttggcgggca 

acgggtgtgg 

atcgcaccgg 



cgtccgctgc 
acggcatgtt 
ttcggccgaa 
acctcacagc 
tggtcgaaac 
ttggcggcgt 
cgcgcgacgg 
tggtgacagc 
caaccgagct 
cctcggcccg 



aggccgcgcg 
ggtgatcgcc 
tatcccgcaa 
gcaaggggtg 
cctgggcagg 
catggtgcgc 
cgacatgctg 
ggtgctgggg 
agccgaatgc 
cgtctatgcc 



ggcaccgccg 
gatcggttac 
gaggatctgc 
ctcgacctcc 
ccagatcgga 
ttcgtcgtgt 
gtgctgcagt 
cccgccgaac 
acaaccgcgt 
gagatcgtgg 



acaacgtggt 
acctggttga 
gagacatcgt 
acggggagcc 
ccttggaggg 
gccgcagggg 
tggtggcgcc 
ccgccaacgt 
cccaattgac 
gtaacccgac 



gagatcgftg" 

ggcgtcctgg 

ctgtacggaa 

ctagagctgc 

tcccgaggc 



ccagccaacg 
actccaagct 
gcttcctgcc 
tccctgcggg 



ccaccccggc 
cggtaggctg 
cggcactcag 
cgcttggcta 



ggcaccacga 
gtgtcgcttc 
cagaacttgg 
gatcacacct 



cgcagaccga 
cccgccgtgt 
agcgtgcgct 
cagatcacgc 



cggcgtcgag 60 
tttccctgtc 120 
ctgggaacag 180 
ccaaccgacg 24 0 
tcgctggtgg 300 
cgaccgccat 360 
gcaggtcggc 42 0 
cgaacccctg 480 
gcaatacggt 54 0 
cggctgggtg 600 
cgccgccgct 6'5"0~" 
tggaggcgac 72 0 
ggacggcttg 780 
acaagcctcc 84 0 
849 



<210> 10 

<211> 552 

<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3867 



<4U0> lU 



18 



atggtggacc 
gccgcccaca 
accgatgacg 
ccggagctgg 
atggacggca 
cagctggccg 
cagtacacgt 
gccctgctgc 
gcagccgaag 
gccgatgact 



cgccgggcaa 
ccaacgaggc 
ccgaaggcga 
agttgttcac 
gaatccagca 
acgagatctt 
tcatggtgga 
gggaattcgt 
ccgaagtgtt 
ga 



cgacgacgac 
gtcgccgctg 
cctggacgcc 
ggtgaccaac 
cgtcgagctg 
cgttattgcc 
gaacatcggt 
ggggatgacc 
cgccacccgc 



cacggtgatc 
gacgccttag 
ctccatgcgc 
cctcaagggt 
acggacaagg 
gatctggccc 
gaactgaccg 
ctgaa'tctgc 
tacgatgtcg 



tcgacgccct 
acgactatgc 
tcaccgaacg 
cggtgtcggt 
cgaccagcat 
gccaaaaggc 
acgaagacgc 
cgacgccgga 
actacacctc 



cgatttctcc 
gccggtgcag 
cgacgaggag 
ctcaaccctg 
gtccgaagcg 
gcgggcgtcg 
agaaggcagc 
agaggctgcc 
ccggtacaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
552 



<210> 11 
<211> 1722 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3868 



<400> 11 

atgactgatc 

gcgcggtcgc 

tggatcggcc 

tcgcgccgca 

gccaggattg 

gcgatcacca 

gaggccttag 

gttgtctacg 

aaatggccgg 

aacctggcct 

ggtgaggcgt 

aacgaaagcg 

gtggctgcgg 

gcatcccgcg 

cggctgctcg 

aatcagattg 

atgaaggtcg 

accacgatcg 

aaactcgtcg 

aagaccgcta 

gcgctggtgc 

ctgctggcgc 

tccgacatag 

atcgagttcg 



gcttggccag 
tagatctgtt 
ggatccggtg 
atttcggaca 
ccatcggggg 
tgggctttgc 
aggccgcccc 
gcgcggccga 
acaagttttt 
tgttcaccga 
gtgcgcgcgc 
ccgcggtggc 
cgctgaagga 
ccgatccctg 
ccgaggccca 
aacgctaccg 
cccagcccag 
cgcgggtggt 
agacgtcgcg 
agacgatcga 
aggaaagaga 
ggatggagaa 
atcggctgct 
acacctattc 



tctgttcgaa 
caccgagatc 
tggggacacc 
gttgtcggga 
gctgtacggc 
cgcatgcgag 
ggtcgcgggt 
acgctggacc 
ggccggcgcg 
agccgaacgc 
catcgcctgg 
gctgctggaa 
tccctcctac 
ggatccgggc 
agccgaactc 
cgcggcgacg 
caagcacatg 
ggccaatatc 
caaggacttc 
tcaggcgctg 
cggccgcacc 
cgaccgggac 
ggaaaccaac 
ccccgaggaa 



agcgccgtca 
accaactacg 
gaccgggtga 
tcggtccaga 
gatatcacct 
gcagcgcaag 
tccgagcacc 
gacgtgatcg 
gccggtgtgg 
cgactcaccg 
tatctggcga 
tggttacaga 
cggctgaaga 
agtgtcgtga 
gaccgccaaa 
ctgatggccc 
atcttcaccg 
ctggccggct 
gtcgccgagt 
ggcggggtgc 
gatccgttcg 
cggctggtgg 
gagggtctgc 
ctcctcgaga 



gcatgttgcc 
acgaatccgc 
cgctgtttcg 
tctcgatgag 
acccggtcac 
gcaattacgc 
tggtggcgtg 
accaggtcaa 
cgcacggggt 
aggccaacga 
tggcacggcg 
ccactcaccc 
cgaccaccgc 
ccgacaactc 
ttgggctcac 
gggtccgcgc 
gaccgcccgg 
taggcgtcat 
acgaggggca 
ttttcatcga 
gtcaagaggc 
tgatcatcgc 
ggtcgcggtt 
tcgccaacgt 



gatgtcggag 
ttgcgacgca 
cgcctggtat 
cacgttaaac 
ctcgccgcta 
cgacgccatg 
gatgaaggcg 
gagtgctggg 
tgccgcggca 
ctcgcccgcc 
cagccagggc 
cgagcccaaa 
cgaacagatc 
cggccgggag 
ccgggttaaa 
cgccaagggt 
taccggcaag 
tgccgaaccc 
atcggcggtc 
cgaggcttat 
gctggacacg 
cgggtacagc 
cgccactcgc 
cattgccgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
TS^O- 
1560 
1620 
1680 
1722 



gctgatgatt 
ttggagcagc 
gcgcgccagc 
gatatcgaca 
gctatcgccg 



cggcgttgac 
gcatgttgcg 
tggtggaggc 
ccctcgacga 
cggtgcacgc 



cgcagaggcg 
cggccggcgc 
cagcgagcaa 
agaccggctt 
acacctcaac 



gccgagaacf" 

gccctggacg 

tgccgggaca 

cgcgagatca 

atgagagaat 



"tftc'E'tcaggc 
tcgccggcaa 
tgcgtctagc 
acggctcaga 
ga 



cgccaagcag 
cggtcggtat 
ccaggtcctc 
tatggcggag 



<210> 12 
<211> 1443 
<212> DNA 

<213> mycobacterium tuberculosis 



<220> 

<223> DNA sequence Rv3 869 



19 



atggggcttc 
ctcgaacacg 

agccgctcga 
ctggcgtact 
accaaccagc 
gcgcggctgg 
aagctgccga 
gcgggcagca 
ccggtagtgc 
ctccagtcac 
aagggacgcc 
gtgacggcca 
ccctggcagc 
gatctagtga 
gtgctgcccg 
caggcgcacg 
gaacgggtat 
gatcccgcgc 
gtgctgtccg 
atccacggga 
gatcctcgat 
cctaacgcgg 
atcgttcgtc 
gacacgctgc 
tga 



gcctcaccac 
ccatcgtgcg 

tcgctcttgg 
tcaaaccaca 
tttacgtgct 
tgctgggcaa 
tgggccagac 
cctcgatctg 
agaccgcggt 
acgaagcggt 
acgccataga 
ggccaacccc 
tgccgccgat 
tcggatcggt 
acggcatcgc 
ggctggtcgc 
acccctcacc 
tgtgctggtc 
gccggcatct 
cggcgaccgt 
acaccgaatc 
agacagccaa 
tcctggtcga 
ccgctgaccc 



caaggttcag 
ccgggacacc 
catcgtcgtc 
aggcaaactc 
gctgtccgga 
tccggccaac 
cgttggaatc 
gaccctatgc 
catcgcgatg 
gctggtgtcc 
tctgaccgac 
gatctcggag 
accggcggcg 
cttccagatc 
gcaggtcaac 
gccaccggca 
gctacccgat 
atggcaacgc 
gccgatatcg 
ttacctcgac 
gatgtactac 
gtcgctgggc 
cggtccggtg 
tagcccccga 



gttagcggct 
cggatgtttg 
gcggtcctga 
ggcggcacca 
cagttgcatc 
ccggccaccg 
cccggcgccc 
gacaccgtcg 
ccgttggaga 
taccagggcg 
cgcgccctca 
ggcatgttca 
ggcgcgccca 
cacaccgaca 
gcgacaaccg 
atggtgccca 
gaaccgctca 
agcgccggcg 
ccctcagcga 
ggcggaaa'at 
atcgatccac 
ctgagttcac 
ctgtcgaaag 
aaagttcccg 



ggcgttttct 
acgacccgct 
ttctggcggg 
gcctgttcac 
cggtctacaa 
tgaagtcctc 
cctacgccac 
cccgagccga 
tcgatgcttc 
aaacctggat 
cctcgtcgat 
acgcgctgcc 
attcgcttgg 
agggcccgca 
ctgcggcgct 
gtctggtcgt 
agatcgtgtc 
accagtcgcc 
tgaacatggg 
tcgtggcact 
agggcgtgcg 
cccaaaacgc 
atgccgcact 
ccggagcctc 



gctgcgccgg 

gcagttctac 

tgccgcgctg 

cgaccgcgcg 

cctgacttcg 

cgaactgagc 

gcctgtttcg" 

ctccacttcc 

gatcgatccg 

cgtcacaact 

ggggataccg 

tgatatgggg 

cctacctgat 

atactatgtg 

gcgcgccacc 

cagaatcgcc 

ccggccgcag 

gcagtcaacg 

gatcaagcag 

gcaatccccc 

ttatggggtg 

gccctgggag 

gctcgagcac 

cggagccccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1443 



<210> 13 
<211> 2244 

<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3870 

<400> 13 

atgacgacca agaagttcac tcccaccatt acccgtggcc cccggttgac cccgggcgag 60 

atcagcctca cgccgcccga tgacctgggc atcgacatcc caccgtcggg cgtccaaaag 12 0 

atccttccct acgtgatggg tggcgccatg ctcggcatga tcgccatcat ggtggccggc 180 

ggcaccaggc agctgtcgcc gtacatgttg atgatgccgc tgatgatgat cgtgatgatg 240 

gtcggcggtc tggccggtag caccggtggt ggcggcaaga aggtgcccga aatcaacgcc 300 

gaccgcaagg agtacctgcg gtatttggca ggactacgca cccgagtgac gtcctcggcc 360 

. acctctcagg tggcgttctt ctcctaccac gcaccgcatc ccgaggatct gttgtcgatc 420 

gtcggcaccc aacggcagtg gtcccggccg gccaacgccg acttctatgc ggccacccga 480 

atcggtatcg gtgaccagcc ggcggtggat cgattattga agccggccgt cggcggggag 540 

ttggccgccg ccagcgcagc acctcagccg ttcctggagc cggccagcca cacgcgggug 600^ 

gtcaagtttc tacgaaccca tggattgatc catgactgcc cgaaactgct gcaactccgt 660 

acctttccga ctatcgcgat cggcggggac ttggcggggg cagccggcct gatgacggcg 72 0 

atgatctgtc acctagccgt gttccaccca ccggacctgc tgcagatccg ggtgctcacc 780 

gaggaacccg acgaccccga ctggtcctgg ctcaaatggc ttccgcacgt acagcaccag 84 0 

accgaaaccg atgcggccgg gtccacccgg ctgatcttca cgcgccagga aggtctgtcg 900 

gacctggccg cgcgcgggcc acacgcaccc gattcgcttc ccggcggccc ctacgtagtc 960 

gtcgtcgacc tgaccggcgg caaggctgga ttcccgcccg acggtagggc cggtgtcacg 1020 

gtgatcacgt tgggcaacca tcgcggctcg gcctaccgca tcagggtgca cgaggatggg 1080 

acggctgatg accggctccc taaccaatcg tttcgccagg tgacatcggt caccgatcgg 1140 

atgtcgccgc agcaagccag ccgtatcgcg cgaaagttgg ccggatggtc catcacgggc 1200 

accatcctcg acaagacgtc gcgggtccag aagaaggtgg ccaccgactg gcaccagctg 1260 

gtcggtgcgc aaagtgtcga ggagataaca ccttcccgct ggaggatgta caccgacacc 1320 

gaccgtgacc ggctaaagat cccgtttggt catgaactaa agaccggcaa cgtcatgtac 1380 

ctggacatca aagagggcgc ggaattcggc gccggaccgc acggcatgct catcgggacc 144 0 

acggggtctg ggaagtccga attcctgcgc accctgatcc tgtcgctggt ggeaacgact 1500 



20 



catccagatc" aggtgaatct cctgctcacc gacttcaaag gtggttcaac cttcctggga 1560 

atggaaaagc ttccgcacac tgccgctgtc gtcaccaaca tggccgagga agccgagctc 1620 

gtcagccgga tgggcgaggt gttgaccgga gaactcgatc ggcgccagtc gatcctccga 1680 

caggccggga tgaaagtcgg cgcggccgga gccctgtccg gcgtggccga atacgagaag 1740 

taccgcgaac gcggtgccga cctacccccg ctgccaacgc ttttcgtcgt cgtcgacgag 1800 

ttcgccgagc tgttgcagag tcacccggac ttcatcgggc tgttcgaccg gatctgccgc 1860 

gtcgggcggt cgctgagggt ccatctgctg ctggctaccc agtcgctgca gaccggcggt 1920 

gttcgcatcg acaaactgga gccaaacctg acatatcgaa tcgcattgcg caccaccagc 1980 

tctcatgaat ccaaggcggt aatcggcaca ccggaggcgc agtacatcac caacaaggag 2 04 0 

agcggtgtcg ggtttctccg ggtcggcatg gaagacccgg tcaagttcag caccttctac 2X00 

atcagtgggc catacatgcc gccggcggca ggcgtcgaaa ccaatggtga agccggaggg 2160 

cccggtcaac agaccactag acaagccgcg cgcattcaca ggttcaccgc ggcaccggtt 2220 

ctcgaggagg cgccgacacc gtga 2244 



<210> 14 
<211> 1776 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3871 
<400> 14 

atgactgctg aaccggaagt acggacgctg cgcgaggttg tgctggacca gctcggcact 60 

gctgaatcgc gtgcgtacaa gatgtggctg ccgccgttga ccaatccggt cccgctcaac 120 

gagctcatcg cccgtgatcg gcgacaaccc ctgcgatttg ccctggggat catggatgaa 180 

ccgcgccgcc atctacagga tgtgtggggc gtagacgttt ccggggccgg cggcaacatc 240 

ggtattgggg gcgcacctca aaccgggaag tcgacgctac tgcagacgat ggtgatgtcg . 3 00 

gccgccgcca cacactcacc gcgcaacgtt cagttctatt gcatcgacct aggtggcggc 360 

gggctgatct atctcgaaaa ccttccacac gtcggtgggg tagccaatcg gtccgagccc 42 0 

gacaaggtca accgggtggt cgcagagatg caagccgtca tgcggcaacg ggaaaccacc 4 80 

ttcaaggaac accgagtggg ctcgatcggg atgtaccggc agctgcgtga cgatccaagt 54 0 

caacccgttg cgtccgatcc atacggcgac gtctttctga tcatcgacgg atggcccggt 600 

tttgtcggcg agttccccga ccttgagggg caggttcaag atctggccgc ccaggggctg 660 

gcgttcggcg tccacgtcat catctccacg ccacgctgga cagagctgaa gtcgcgtgtt 720 

cgcgactacc tcggcaccaa gatcgagttc cggcttggtg acgtcaatga aacccagatc 780 

gaccggatta cccgcgagat cccggcgaat cgtccgggtc gggcagtgtc gatggaaaag 84 0 

caccatctga tgatcggcgt gcccaggttc gacggcgtgc acagcgccga taacctggtg 900 

gaggcgatca ccgcgggggt gacgcagatc gcttcccagc acaccgaaca ggcacctccg 960 

gtgcgggtcc tgccggagcg tatccacctg cacgaactcg acccgaaccc gccgggacca 102 0 

gagtccgact accgcactcg ctgggagatt ccgatcggct tgcgcgagac ggacctgacg 10 80 

ccggctcact gccacatgca cacgaacccg cacctactga tcttcggtgc ggccaaatcg 1140 

ggcaagacga ccattgccca cgcgatcgcg cgcgccattt gtgcccgaaa cagtccccag 1200. 

caggtgcg gt tcatg ctcgc g gactaccg c tcgggcct gc tggacg cggt gccg gacacc 12 60 



catctgctgg gcgccggcgc gatcaaccgc aacagcgcgt cgctagacga ggccgttcaa 132 0 

gcactggcgg tcaacctgaa gaagcggttg ccgccgaccg acctgacgac ggcgcagcta 13 8 0 

cgctcgcgtt cgtggtggag cggatttgac gtcgtgcttc tggtcgacga ttggcacatg 144 0 

atcgtgggtg ccgccggggg gatgccgccg atggcaccgc tggccccgtt attgccggcg 1500 

gcggcagata tcgggttgca catcattgtc acctgtcaga tgagccaggc ttacaaggca 1560 

accatggaca agttcgtcgg cgccgcattc gggtcgggcg ctccgacaat gttcctttcg 1620 

ggcgagaagc aggaattccc atccagtgag ttcaaggtca agcggcgccc ccctggccag 1680 

gcatttctcg tctcgccaga cggcaaagag gtcatccagg ccccctacat cgagcctcca 174 0 

gaagaagtgt tcgcagcacc cccaagcgcc ggttaa 1776 



<210> 15 

<211> 297 

<212> DNA 

<213> Mycobacterium tuberculosis 



21 



<220> 

<22 3> PE coding sequence (Rv3 872) 



<400> 15 
atggaaaaaa 

gctctgcacg 
gcgggggccg 
ttgctggctt 
gacgtcgccc 



tgtcacatga 

gcgtgacggc 
atgaggtctc 
ccaatgcatc 
gcacctattc 



tccgatcgct 
cggctcgacg 
cgcccaagcg 
ggcccaagac 
gcaaatcgac 



gccgacattg 
gcgctgacgt 
gcgacggcgt 
cagctccacc 
gacggcgccg 



gcacgcaagt gagcgacaac 60 

cggtgaccgg gctggttccc 120 

tcacatcgga gggcatccaa 180 

9tgcgggcga agcggtccag 240 

ccggcgtctt cgccgaa 297 



<21p> 16 
<211> 1104 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> PPE coding sequence {Rv3873) 



<400> 16 

atgctgtggc 

ccggctccaa 

caggccgtcg 

agcgacaagg 

caggccaaga 

gccacgacgc 

gccaccaact 

atccgtatgt 

aacacgcttt 

agcacgacga 

ttgccgccgg 

cagctgaccc 

ggcggcaacc 

tcgaaccatc 

gagtcgctac 

gaaaagccgg 

ggcgccgctc 

aggccgggtc 

gactgggacg 



acgcaatgcc 
tgcttgcggc 
agttgaccgc 
cgcttgcggc 
cccgtgcgat 
cgtcgctgcc 
tcttcggtat 
ggaaccaggc 
tcgagaagct 
acccgatctt 
cggctaccca 
agccgctgca 
cagccgacga 
cgctggctgg 
ctggcgcagg 
ttgccccctc 
cggtgggtgc 
tggtcgcgcc 
aagaggacga 



accggagcta 
ggccgcggga 
gcgcctgaac 
tgcaacgccg 
gcaggcgacg 
ggagatcgcc 
caacacgatc 
agccctggca 
cgagccgatg 
cggaatgccc 
gaccctcggc 
gcaggtgacg 
ggaagccgcg 
tggatcaggc 
tgggtcgttg 
ggtgatgccg 
gggagcgatg 
ggcaccgctc 
ctgg 



aataccgcac 
tggcagacgc 
tctctgggag 
atggtggtct 
gcgcaagccg 
gccaaccaca 
ccgatcgcgt 
atggaggtct 
gcgtcgatcc 
tcccctggca 
caactgggtg 
tcgttgttca 
cagatgggcc 
cccagcgcgg 
acccgcacgc 
gcggctgctg 
ggccagggtg 
gcgcaggagc 



ggctgatggc 
tttcggcggc 
aagcctggac 
ggctacaaac 
cggcatacac 
tcacccaggc 
tgaccgagat 
accaggccga 
ttgatcccgg 
gctcaacacc 
agatgagcgg 
gccaggtggg 
tgctcggcac 
gcgcgggcct 
cgctgatgtc 
ccggatcgtc 
cgcaatccgg 
gtgaagaaga 



cggcgcgggt 
tctggacgct 
tggaggtggc 
cgcgtcaaca 
ccaggccatg 
cgtccttacg 
ggattatttc 
gaccgcggtt 
cgcgagccag 
ggttggccag 
cccgatgcag 
cggcaccggc 
cagtccgctg 
gctgcgcgcg 
tcagctgatc 
ggcgacgggt 
cggctccacc 
cgacgaggac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1104 



<210> 17 
<211> 300 
<212> DNA 

<213> Mycobacterium tuberculosis 



<220> 

<223> CFP-10 coding sequence (Rv3874) 



<400> 17 

atggcagaga 

atctccggcg 

ggccagtggc 

gcagccaata 

gtccaatact 



tgaagaccga 
acctgaaaac 
gcggcgcggc 
agcagaagca 
cgagggccga 



tgccgctacc 
ccagatcgac 
ggggacggcc 
ggaactcgac 
cgaggagcag 



ctcgcgcagg 
caggtggagt 
gcccaggccg 
gagatctcga 
cagcaggcgc 



aggcaggtaa 
cgacggcagg 
cggtggtgcg 
cgaatattcg 
tgtcctcgca 



tttcgagcgg 
ttcgttgcag 
cttccaagaa 
tcaggccggc 
aatgggcttc 



60 
120 
180 
240 
300 



<210> 18 
<211> 285 
<212> DNA 



<213> Mycobacterium tuberculosis 
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<220> 

<223> ESAT-6 coding sequence (Rv3875) 



<400> 18 

atgacagagc 

aatgtcacgt 

gcggcctggg 

acggctaccg 

caggcaatgg 



agcagtggaa tttcgcgggt atcgaggccg cggcaagcgc aatccaggga 60 

ccattcattc cctccttgac gaggggaagc agtccctgac caagctcgca 12 0 

gcggtagcgg ttcggaggcg taccagggtg tccagcaaaa atgggacgcc 180 

agctgaacaa cgcgctgcag aacctggcgc ggacgatcag cgaagccggt 240 

cttcgaccga aggcaacgtc actgggatgt tcgca 285 



<210> 19 
<211> 2001 
<212> DNA 

<213> mycobacterium tuberculosis 

<220> 

<22 3> DNA sequence Rv3 876 



<400> 19 

atggcggccg 

atggcagcgc 

aacctaccga 

ttcgtgtcgg 

ccgatcgccg 

cccatgccca 

atcgccggac 

cctgcaccca 

ccaaccggag 

catcaacccc 

ccgcccgctc 

caacactccc 

gggaaggtag 

gcgcagctcg 

ctggctccgc 

aactccggtc 

gcgcaacctg 

ccggatctcg 

aaggtgaagc 

tggcgacatt 

tacgagctgg 

gtcgtcggtc 

ttggctcagg 

ctcqccqatc 



actacgacaa 
agccgttctt 
agcccaacgg 
ccccgccgcc 
caggagagcc 
tcgccggacc 
ccgaaccggc 
ccccaaccga 
cgccgcagca 
ggcgcaccgc 
cgtccagacc 
gacgtgcgcg 
caactggtcc 
cccccggaac 
ccacccgccc 
ggcgtgccga 
attcaattac 
acgcgacaca 
cccagaaacc 
gggtgcatgc 
acctgcacgc 
tcaaaggtgg 
tgcgggccga 
gggtagggcg_ 



gctcttccgg 
cgaccccagt 
ccagactccg 
gccaccccca 
gccctcgccg 
cgaaccggcc 
cccacccaaa 
atcccagttg 
accggaatca 
accagcaccg 
gtctgcgtcc 
ccggggtcac 
atccatccag 
ggagccctcg 
cgcgccgaca 
gcgacgcgtc 
ggccgcaacc 
gaaatcctta 
gaaggccacg 
gttgacgcga 
tcgagtccgc 
ggctggcaaa 
ccggatcctg 
acaatc gggc 



ccgcacgaag 
gcttcgtttc 
cccccgacgt 
cccccacctc 
gaaccggccg 
ccacccaaac 
ccacccacac 
gcgcccccca 
ccggcgcccc 
ccctgggcaa 
ccggccgaac 
cgctatcgca 
gcgcggctgc 
ccagcgccgt 
gaacctcccc 
caccccgatt 
actggcggtc 
aggccggcgg 
aagccgccca 
atcaacctgg 
cgcaatcccc 
accacgctga 
gctctagacg 
Sc gacca tcg. 



gagctgtcgc 
gtgctgccgg 
ttcatcgccg 
ttcttcgacc 
agtgtctcaa 
ggttaccaag 
cccaatgtcg 
gtcgtggtca 
ctcgacccta 
gagagggctg 



actacaacga 
caccggaata 
atcctgcgtc 
cgctgacccg 
tcgacggcgc 
atttggcgag 
cagttaaaga 
tgccgtggga 
tctacaagcg 
gacgtcgttg 



catccgcgca 
cagctcggcg 
gaggttttac 
cggcgtgctg 
acaacaggcg 
ccgcgcatgc 
cctggtgcgg 
caggcacatt 
caaggtcctc 



cacactagcg 
cagcgcgcgc 
aacctcgtct 
tccacggtgt 
tcggtcgcgt 
gtggtcatca 
catttcgaac 
gcggccggaa 
gaattggccg 



gtatggaagc 
cgccggcgcc 
ccgacgacct 
cgcctccgcc 
catctaaacc . 
cacccacacc 
ctccgatgcc 
gaccaccgac 
acgtaccctc 
agatgccaat 
caccgacccg 
cagacaccga 
gggcagagga 
tgggccaacc 
ccagcccctc 
tagccgccca 
gtcgccgcaa 
ccaaggggcc 
aagtggtgtc 
gcctgtcacc 
gcgggtcgta 
cagcagcgtt 
cggatccagg 
_QJtg^i:_g.t,g.cJ:- 
tcaatgcggt 
tcagcgacgc 
tggctgattg 
ccggtgtcgt 
tggactggtt 
atcacatcat 
agcaagttca 
ccgagatttc 
cagcgctatc 



tccggacgat 

cgcatcggca 

gtcggagcgg 

aactccgatg 

acccacaccc 

ccccatgccc 

catcgccgga 

accacaaacg 

gcacgggcca 

cggcgaaccc 

gcctgccccc 

acgaaacgtc 

agcatccggc 

gagatcgtat 

gccgcagcgc 

acatgccgcg 

gcgtgcagcg 

gaaggtgaag 

gcagcgcggc 

cgacgagaag 

tcagatcgcc 

ggggtcgacg 

cgccggaaac 

Jtjgjc.a g a a a a a_ 

caatctggaa 

cgactggcat 

tggggccggc 

ggtcgtggca 

gcgcaacaac 

gccgggagaa 

acccggccgg 

actcgacttg 

cgacgatttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2001 



<210> 20 
<211> 1536 
<212> DNA 



<213> mycobacterium tuberculosis 
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<220> 

<223> DNA sequence Rv3877 



<400> 20 

ttgagcgcac. 

gccaccaccc 

gcggtgccga 

gacacgccgg 

gctcgtcccg 

gacgggtcac 

gaggatgtca 

ttgaatcgct 

atgcgggcgt 

gggatcgctg 

gccgagtgcc 

ccgttgccgc 

gtgctgtttt 

tttgccgtga. 

caggactggg 

aagctgaccg 

gtggacaacg 

ccgacctggc 

agcaaactgg 

gccggtgcca 

ggtttgatca 

tgggcgttgc 

tggtacccgc 

ctcgtggtgg 

ctggaattga 

ggggtgtacg 



ctgctgttgc 
gggtgacgat 
tggaaactta 
ctgatgtact 
gatcgccgcc 
tgctgactct 
tcgacgcgat 
ttgtgggggc 
ggtgggaaac 
tgctggtagg 
tactggtcac 
gcggggtcaa 
tgaccttgat 
tcaccgctat 
tccccgcggg 
tcgcggtcgc 
aggagttgct 
aggccatcat 
ccaagcaact 
tcgcggtcgt 
cgaccgtctg 
tggcggcgac 
actatgcctg 
tcgggtcgat 
tcgacggcgc 
acacggtccg 



tgctggtcct 
cctgaccggc 
tattgacgac 
cggcggcttc 
gctgaagctc 
ggtgtcagtc 
cgccgtgctt 
ggcgatcccg 
tgggcgtagc 
cagcttcgtc 
gacgtatctg 
ctcgttgggg 
gacgcggggc 
cgcggtcatc 
ggggatcgca 
gcggatcgcg 
cgatcccgtc 
cgcgtcggtg 
tctgatcgga 
ggtgcgcggg 
cggatttcgc 
ggtcgcgatt 
gctgttgttg 
ggctcacgtc 
catgatcgct 
caatatccgg 



accgccgcgg 
agacggatga 
accgtcgcgg 
gactttaccg 
gaccagtcac 
agtcgcaccg* 
gacgagtcac 
cttttgaccg 
ttgtggtggc 
gcgaacaggt 
ctgatcgcaa 
gcgccacaag 
ggccctcgga 
gcggccgccg 
ttcgggctgt 
ctgccgccga 
gcgaccccgg 
cccgcgtccg 
tacgtcacgt 
cacttctttg 
tcgcggcttt 
ccgacgggtc 
agcgtctacc 
cggcgcgttt 
gccatcattc 
ttctga 



gggcaaccgc 
ccgatttggt 
tgctttccga 
cgcaaggcgt 
tcgatgacgc 
agcgctaccg 
ctgagttcga 
cgcccgtcat 
cgttggcgat 
tctaccagag 
ccgccgcagc 
ttgccggcgc 
agcgtcatga 
ctgccttcgg 
tcattgtgac 
ttccggtacc 
aggctaccag 
cggtccggct 
cgggcaccct 
tacacagcct 
acgccgagcg 
tgacggccaa 
tcacggtagc 
caccggtcgt 
ccatgctgct 



tgcgcggcct 
actgccagcg 
ggtgttggaa 
gtgggcgttc 
cggggtggtc 
accgttggtc 
ccgcacggca 
cgggatggcg 
tggcatcctg 
cggccacctg 
gctggccgtg 
cgctacggcc 
gttggcgtcg 
ctatggatac 
gaatgcggcc 
cggcgaaacc 
cgaagaaacc 
caccgagcgc 
gattctggct 
99tggtcgcg 
ctggtgtgcg 
actcatcatc 
cctggttgcg 
aaaacgaact 
gtggatcacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1536 



<210> 21 

<211> 840 

<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<22 3> DNA sequence Rv3 878 



<400> 21 
atggctgaac 
ggcctcgttt 
gcagcaatca 



cgttggccgt 
ttccgcagcc 

acga gaccat 



cgatcccacc 
tccggcgccg 
gccaagcatc 



ggcttgagcg 
atcgcggtca 
gaatcgctgg 



cagcggccgc 
gcggaacgga 

tcagtgacg g 



gtgaaagccg 
aagaccgatc 
ggcctggctg 
acacccgtgt 
gttgttgcga 
aacgcatccc 
cagggcggca 
caagcggagc 
ccggccgagg 
cccggcgggg 
agtccgctgg 



ccctgactcg 
agtcactggg 
gcgtcgcctc 
cacaggtcac 
cggtgccgca 
ccatcgctca 
gcggcccaat 
cggtccacga 
tcgttgccgc 
gcgttcccgc 
cggcccccgt 



aacagcatcc 
aaccagtttg 
ggtcggtggt 
gacccagctc 
actcgttcag 
gacgatcagt 
gcccgcacag 
agtgacaaac 
ggcacgtgac 
gcaagccatg 
cgatccgtcg 



aacatgaacg 
agccagtatg 
cagccaagtc 
ggcgagacgg 
ctggctccgc 
caaaccgccc 
cttgccagcg 
gacgatcagg 
gaaggcgccg 
gataccggag 
actccggcac 



cggcggcgga 
cattcggctc 
aggctaccca 
ccgctgagct 
acgccgttca 
aacaggccgc 
ctgaaaaacc 
gcgaccaggg 
gcgcatcacc 
ccggtgcccg 
cctcaacaac 



gaaattggcc 
ttcggtggta 
gctgcccg gc 
cgtctatgcg 
gtcgggcgaa 
gctgctgagc 
ggcaccccgt 
gatgtcgcaa 
ccagagcgcg 
ggccaccgag 
cgacgtgcag 
gggccagcag 
cccagcggcg 
cacaacgttg 



60 

120 

180 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



<210> 22 
<211> 2187 



<212> DNA 




<213> Mycobacterium tiiberculosis 

<220> 

<223> DNA sequence Rv387 9c 
<400> 22 

atgagtatta ccaggccgac gggcagctat 
gtggaagccg atgaagacac tttctatgac 
agggtcaccg atgtattgga. cacctgccgc 
ctatggtccg gcggcgccgc caatgctgcc 
ttgatgacgc tgcaggatta tctcgccacg 
ttgattgagc aagctaaatc cgatatcggc 
gatatcctgg agaatgaccc tagcctggat 
ttggtcacgg cgacgcatgg ggccaatgtc 
ctggaatcca agaattggaa acctccgaag 
tcgccgccac ccccagacgt gcctaccctg 
ccgggaaccc cgatcacccc gggaaccccg 
ccgggagcgc cggtaactcc gatcacacca 
ccgggcaagc cggtcacccc ggtgaccccg 
ccgatcacgc cggtcacccc cccggtcgcc 
gttaccccag ctcccgctcc acacccgcag 
ccccagccgg ttacaccggc cactcccggt 
gggggcgagc cggcgccgca cgtcaaaccc 
ggccagcatg cgggcggggg gacgcagtcg 
tcggtgacgc cggctgcggc gtccggtgtc 
agcggtaccg ccgtgggagc gggcgcgcgt 
gcggggtcgc atgctgccac tgggcgggcg 
ccgagcacgc gggcggcctc ggcgcggacg. 
cacatcgaca aacccgatcg cagcgagtct 
ccggtgtcgg cggctcgggc ggcacgcgac 
cgtggccgcg gtgatgcgct gcggttggcg 
gacaacaacg cgggcgacta cgggttcttc 
atcgtcgtgg ccaacagcta tgggctggcc 
aaggtgtact tggccagcgc ggatcacgca 
acctacccgg ttttggccgt gcaagcctgg 
gtgatcggta ccgcggagca gttggccagt 
gagccagatg acattccgga gagcggcaaa 
gacccctcgg cggcggctca gctggccgac 
ccgccggcgc cggtggatgt caatccaccg 
ctgatgaagc ccatgaccag caccgctacc 
cgggcctacg ctgcccactc acaggagatt 
gcggccgtcc agcgtgtggc cgtcgcggac 
ctcgaccggg ccctggccgc cgcatgc 



24 




gccagacaga tgctggatcc gggcggctgg 60 
cgggcccagg aatatagcca ggttttgcaa 12 0 
cagcagaaag gccacgtctt cgaaggcggc 180 
aacggcgccc tgggtgcaaa catcaatcaa 240 
gtgattacct ggcacaggca tattgccggg 300 
aataatgtgg atggcgctca acgggagatc 360 
gctgatgagc gccataccgc catcaattca 420 
agtctggtcg ccgagaccgc tgagcgggtg 4 80 
aacgcactcg aggatttgct tcagcagaag 54 0 
gtcgtgccat ccccgggcac accgggcaca 600 
atcaccccgg gaaccccaat cacacccatc 660 
acgcccggca ctcccgtcac gccggtgacc 720 
gtcaaaccgg gcacaccagg cgagccaacc 7 80 
ccggccacac cggcaacccc ggccacgccc 840 
ccggctccgg caccggcgcc atcgcctggg 900 
ccgtctggtc cagcaacacc gggcacccca 960 
gcggcgttgg cggagcaacc tggtgtgccg 1020 
gggcctgccc atgcggacga atccgccgcg 1080 
ccgggcgcac gggcggcggc cgccgcgccg 114 0 
tcgagcgtgg gtacggccgc ggcctcgggc 1200 
ccggtggcta cctcggacaa ggcggcggca 1260 
gcacctcctg cccgcccgcc gtcgaccgat 1320 
gcagatgacg gtacgccggt gtcgatgatc 1380 
gccgccactg cagctgccag cgcccgccag 1440 
cgacgcatcg cggcggcgct caacgcgtcc 1500 
tggatcaccg cggtgaccac cgacggttcc 1560 
tacatacccg acgggatgga attgccgaat 1620 
atcccggttg acgaaattgc acgctgtgcc 1680 
gcggctttcc acgacatgac gctgcgggcg 1740 
tcggatcccg gtgtggccaa gattgtgctg 1800 
atgacgggcc ggtcgcggct ggaggtcgtc 186 0 
actaccgatc agcgtttgct cgacttgttg 1920 
ggcgatgagc ggcacatgct gtggttcgag 1980 
ggccgcgagg ccgctcatct gcgggcgttc 204 0 
gccctgcacc aagcgcacac tgcgactgac 2100 
tggctgtact ggcaatacgt caccgggttg 2160 

2187 



<210> 23 
<211> 345 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3880c 
<400> 23 

gtgagcatgg acgaattgga cccgcatgtc gcccgggcgt tgacgctggc ggcgcggttt 60 
cagtcggccc tagacgggac gctcaatcag atgaacaacg gatccttccg cgccaccgac 120 
gaagccgaga ccgtcgaagt gacgatcaat gggcaccagt ggctcaccgg cctgcgcatc 180 
gaagatggtt tgctgaagaa gctgggtgcc gaggcggtgg ctcagcgggt caacgaggcg 240 
ctgcacaatg cgcaggccgc ggcgtccgcg tataacgacg cggcgggcga gcagctgacc 3 00 
qctqcqttat cqcrccatqtc ccgcgcqatg aacgaaggaa tggcc 345 



25 



<210> 24 
<211> 1380 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3881c 
<400> 24 > 

atgacgcagt cgcagaccgt gacggtggat cagcaagaga ttttgaacag ggccaacgag 60 
gtggaggccc cgatggcgga cccaccgact gatgtcccca tcacaccgtg cgaactcacg 120 
gcggctaaaa acgccgccca acagctggta ttgtccgccg acaacatgcg ggaatacctg 180 
gcggccggtg ccaaagagcg gcagcgtctg gcgacctcgc tgcgcaacgc ggccaaggcg 240* 
tatggcgagg ttgatgagga ggctgcgacc gcgctggaca acgacggcga aggaactgtg 300 
caggcagaat cggccggggc cgtcggaggg gacagttcgg ccgaactaac cgatacgccg 360 
agggtggcca cggccggtga acccaacttc atggatctca aagaagcggc aaggaagctc 420 
gaaacgggcg accaaggcgc atcgctcgcg cactttgcgg atgggtggaa cactttcaac 4 80 
ctgacgctgc aaggcgacgt caagcggttc cgggggtttg acaactggga aggcgatgcg 54 0 
gctaccgctt gcgaggcttc gctcgatcaa caacggcaat ggatactcca catggccaaa 600 
ttgagcgctg cgatggccaa gcaggctcaa tatgtcgcgc agctgcacgt gtgggctagg 660 
cgggaacatc cgacttatga agacatagtc gggctcgaac ggctttacgc ggaaaaccct 720 
tcggcccgcg accaaattct cccggtgtac gcggagtatc agcagaggtc ggagaaggtg 780 
ctgaccgaat acaacaacaa ggcagccctg gaaccggtaa acccgccgaa gcctcccccc 840 
gccatcaaga tcgacccgcc cccgcctccg caagagcagg gattgatccc tggcttcctg 900 
atgccgccgt ctgacggctc cggtgtgact cccggtaccg ggatgccagc cgcaccgatg 960 
gttccgccta ccggatcgcc gggtggtggc ctcccggctg acacggcggc gcagctgacg 1020 
tcggctgggc gggaagccgc agcgctgtcg ggcgacgtgg cggtcaaagc ggcatcgctc 1080 
ggtggcggtg gaggcggcgg ggtgccgtcg gcgccgttgg gatccgcgat cgggggcgcc 1140 
gaatcggtgc ggcccgctgg cgctggtgac attgccggct taggccaggg aagggccggc 12 00 
ggcggcgccg cgctgggcgg cggtggcatg ggaatgccga tgggtgccgc gcatcaggga 1260 
caagggggcg ccaagtccaa gggttctcag caggaagacg aggcgctcta caccgaggat 1320 
cgggcatgga ccgaggccgt cattggtaac cgtcggcgcc aggacagtaa ggagtcgaag 13 80 



<210> 25 
<211> 1386 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3882c 

<400> 25 

atgagaaatc ctttagggct gcgg ttcagc accg g gcacg ccttgctt QC ctccgcgttg 60 



gccccgccat gcatcatcgc attcttggag acgcgctact ggtgggcggg gattgcgctg 120 

gcctcgttgg gcgtcatcgt ggccacggtc actttctacg gccgccggat caccggctgg 180 

gtggcggcgg tgtacgcgtg gttgcggcgg cgccgacggc ccccggattc ctcgtcagaa 240 

cctgtggtcg gggccaccgt gaagccagga gatcacgttg cggtgcgctg gcaaggcgag 300 

tttctggtcg ccgtaatcga gctcattccc cgaccattca cgccgacggt catcgtcgac 360 

gggcaagccc acaccgacga catgctggac accggactgg tggaggagct cctgtcggtg 42 0 

cactgtcccg acttggaggc cgatatcgtc tcagccggct accgcgtcgg caataccgca 480 

gcgccggacg tggtgagtct gtatcagcag gtgatcggga cagacccggc gccggcgaac 540 

cgccggacct ggatcgtgct gcgcgccgac ccggaacgca cccgcaaatc ggcgcagcgc 600 

cgcgatgaag gcgtcgcagg actggcccgg tatttggtgg cgtccgcgac gcgcattgcc 660 

gatcgactgg ctagccatgg tgtcgacgcg gtgtgtggcc gcagcttcga tgactacgac 72 0 

cacgccaccg acatcggctt tgtgcgggag aaatggtcga tgatcaaggg gcgcgatgcc 780 

tacactgccg cctacgcggc gcccggaggt ccggatgtat ggtggtcggc gcgcgcggac 84 0 

cacaccatca ccagagtccg ggtcgcgccg gggatggccc cgcagtccac ggtgttgctg 900 

accacgqcqq acaaQcccaa gacacccagQ ggcttcgccc gcctatttqg cgggcagcQg 960 

cccgcgctgc aaggccagca tctggtggcc aaccgccact gccagctgcc gatcgggtca 1020 




gctggggtac tggtcggcga gacggtgaac cgatgcccgg tctacatgcc cttcgacgat 1080 
gtcgacatcg ccctcaacct gggtgacgct cagacattca cccagttcgt ggtgcgtgcg 1140 
gcggcggcag gtgcgatggt cacagtcggg ccacagttcg aggaatttgc ccggttgatc 1200 
ggcgcacaca tcgggcagga ggtaaaggtg gcgtggccga atgcgacgac ctatctcggc 1260 
ccgcatcccg gtattgaccg ggtgattctg cggcacaatg tgatcggtac cccgcggcat 1320 
cggcagctgc cgattcgccg ggtttcccca cccgaggaaa gccgctacca gatggcgctg 13 80- 
ccgaag 1386 

<210> 26 
<211> 1338 
<212> DNA 

<213> Mycobacterium txiberculosis 
<220> 

<223> DNA sequence Rv3883c 



<400> 26 

gtgcaccgta tctttctgat cacggtggcg ctggcgttgc tcaccgcgtc gcccgcatcg 60 
gccatcacgc caccgccgat cgatccgggc gcgttgccgc ccgacgtgac gggcccggat 120 
cagcctaccg aacagcgcgt tttgtgcgcg tcgcccacca cgctgccggg gtccgggttc 180 
cacgatccgc cgtggagcaa cacgtatctg ggcgtggccg atgcccacaa gttcgcgacc 240 
ggggccgggg tgacggtggc ggtgatcgac accggtgtcg acgcttcgcc acgggtcccg 3 00 
gcggaacctg gcggcgattt cgtcgaccag gccggtaacg gcctgtctga ctgtgatgcc 3 60 
catgggactc tcacagcatc catcatcgcg ggccggcccg cgcccaccga cgggttcgtc 420 
ggcgtcgcgc ccgacgctcg actgctctcg ctacgtcaga cgtctgaggc cttcgaaccg 480 
gtcggctcac aagccaaccc gaatgacccc aacgccaccc cggccgccgg ttccatccgc 540 
agtcttgccc gcgccgtggt gcacgccgcc aacctcggcg tgggtgtgat caacatcagt 600 
gaagccgcct gctacaaggt gagcaggccg atcgatgaaa cctcactggg tgcatccatc 660 
gactatgcgg tcaacgtcaa aggcgtggtg gtggtggtcg cggccggcaa caccggtggc 72 0 
gattgcgtac agaatccggc gccggacccg tccacacccg gcgacccacg cggctggaac 7 80 
aatgtgcaga ccgttgtcac cccggcgtgg tacgcaccgc tggtgttaag cgtcggcggt 84 0 
atcggccaga ccgggatgcc cagctcgttc tcgatgcacg gaccgtgggt ggacgtggcc 900 
gcgcccgcag aaaacatcgt cgcgctcggc gacaccggtg aaccggtgaa tgcgctgcaa 960 
ggccgggagg ggccggtacc catcgccggc acctcgtttg ccgcggcata tgtgtcgggt 1020 
ctggcggccc tgcttcggca gcggttcccc gacctgacgc cggcgcagat catccaccgg 1080 
atcaccgcca ccgcgagaca ccccgggggc ggggtcgacg acctggtcgg cgccggcgtc 114 0 
atcgatgcgg tggccgcgct gacgtgggac attccgcccg gccctgcttc ggcgccatac 1200 
aacgtcagac gacttccacc cccggtggtg gagccgggtc ccgatcgtcg cccgattacg 1260 
gctgtggcgt tggtggccgt cggccttacg ttggccctgg gcctgggcgc gctggctaga 13 2 0 
cgggcgctga gccgccga 13 3 8 



<210> 27 

<211> 1857 

<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3884c 



<400> 27 

atgtcgagaa 

gcgatgacga 

accgaggccg 

gatctggcct 

cggatcggcc 

accgacgcat 

gcgaaggccg 

tqqcatcaqc 



tggtggacac 
tcaagaatgg 
atccgtcgat 
cgcttaagca 
ggacgttggc 
ctcaggtggg 
atgccctgtt 
tqqctcqqqc 



gatgggtgat 
ccagggatgc 
ggccgacgcg 
gctcaacgcc 
cgctgaggtc 
gctggcgctg 
agcaaaccgc 
qttcctqatq 



ttactcactg 
gtggcggcgt 
tggctgggtc 
catagcgagt 
cagctgggac 
tcgtcggcgt 
gagctattgg 
tacqtcacqc 



cgcgccggca 
tgcctgagtt 
gtatcgcctg 
ggctgcaccg 
catccatcgg 
tgacgatcgc 
attcgtggcg 
aqcqatggcc 



tttcgatcgg 60 
tgtggctgcc 120 
cggtgaccgc 180 
cgagaccacg 240 
gatcacggtg 300 
gggggagtat 3 60 
caactaccag 420 
cqacgtqttq 480 



tcgacggccg ccgaggatct gccgccacag gcgatcgtca tgccggcggt gaccgcgtcg 54 0 



27 



atttgtgcgc 
tggctggacc 
gtgctcaccg 
gtgcggggga 
aaggccacca 
ctgcgcttga 
tcgacggcga 
ctgctagccg 
gcggtatcgg 
ccggtggagg 
accaccgctg 
attcgagaag 
acgaacgagc 
ctgatcgaac 
ctcctggttc 
gatcaggtgg 
ctgcggttcg 
ccgcgcgcca 
atccgtaact 
ttcgcccgca 
aaacgtgcgg 
gatgccgcga 



tggcagccca 
gggtggacgt 
cggcgatcgg 
tggtgtaccg 
tcaacggggt 
ttgttaccga 
aaagccgcga 
agggccggga 
cgctggaaga 
ggcagaccaa 
aagcgctcgg 
ttcgccgatc 
tgatcgaaaa 
gtcatcaaga 
aattggaaac 
atgaattcct 
agtcttattc 
gccagctcga 
acaccacccc 
acgtgatcga 
gccaaccggt 
tacgcagcgt 



cgccgccgcc 
gatcggacac 
accggccgat 
gcaactgcat 
gctcaccgac 
tgaacgaacc 
ccagctcgat 
actgctggcc 
ccaactcgag 
ccacatgttg 
caagatctac 
ggacttctgt 
gtcactcggg 
cggaacaccg 
acaccgattc 
caccgtgaac 
gccggtggag 
tgacgccgca 
tagtgggcag 
acgcgccgaa 
atcggttcag 
gtgctcagac 



catctcgggc 
agcaggtcat 
attccgctgc 
gaggaggaca 
gccgccaaag 
atcgccagcc 
gacgacaatg 
aaacaggtgg 
gtgcgcatga 
ctggtggggc 
gccggcatgg 
gggcactaca 
cgaatcattt 
gacatgatcg 
gacttctgtt 
ccgggtttgg 
atcgtcgaga 
cgggaggtat 
cacggtatcg 
gggttccggg 
gatctgcaga 
aaccgagaca 



aggggcgagt 
cggagcggtt 
tggtcgccga 
aggcccagat 
aagccctggc 
gctccgaccg 
cagcgcagcg 
gcctggcggc 
tgcgcctaga 
caccaggcac 
ggatcgtgcg 
tcggggagtc 
tcatggacga 
gcatggaggc 
tcatcggggc 
ctggccggtt 
ttggacaccg 
tcctcgacgc 
acgctatgca 
acacccgggt 
tcatcaccgc 
tggccgcgat 



ggccctggac 
cggcgccgac 
cttggcgtat 
ctggctgtcg 
ggacccgaac 
ttgggatgct 
gcgcggcgag 
ggtcaagcaa 
gcacggccta 
aggtaagaca 
tcaccccgaa 
aggacccaag 
gttctactcg 
ggtcaatcaa 
cggctatgag 
caaccgaaag 
ctacgctaca 
ggtcaccacc 
aaacggtcgg 
ggttgcgcaa 
caccgacatc 
cgtttgg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1857 



<210> 28 
<211> 1611 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3885c 



<400> 28 

ttgacgtcca 

acagtgttcg 

gctgtcgtgg 

tgggcggtac 

gccaacaacc 

caacttctcg 

gacaacgtga 

ctcgattcaa 

cgggtgtatg 

atcatgcggc 

gccgctgcca 

Qccaa actgg 



agctgaccgg 
tgctcgcgtc 
tcggcgtcgc 
tggggctgcg 
gatccggggg 
gccgagcgca 
ttgacgtcgt 
tcagcgtcgt 
acgcggagat 
ttccggtgat 
tttcggtcgc 
ccaccgcaac 



gttcagcccg 
ggcgggctgg 
cttggtgttc 
gggtcggcgt 
tggcgtccgc 
ccgggcgact 
tgagctcgcg 
caccttcggc 
cggtacgccg 
cggcaacacc 
ccaacgcgtt 
agacttggct 



cgcagtgcga 
gcgctgggcg 
gtacagtggt 
cccgtcaaat 
gtgcaagacg 
acggtcaccg 
ccgttgctgc 
tcgcgaaccg 
ccgtatgccg 
caagctttac 
gccagctccc 
gagcttgatc 



ggcgggtcgc 
gccagctagg 
ggggtcagcc 
ggaatgaccc 
gtgtcgcggt 
ggtcggtgac 
gccacccgct 
gcaccgtcgg 
ggcggcgcga 
gctggcgtac 
tgcgctgtca 
gcQ ggg.tggg- 



cggggtgtgg 
tgcggtcatg 
ggcgtggtcg 
aattaccttg 
ggtggcggtg 
cgtagaaagc 
ggacctggaa 
cgattacccg 
aacgtggctg 
cagcgttggg 
gggcttgcgc 
gtc gg acgcg . 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
JI2_0_ 



gtagccggga 
tatgcgtacc 
gccgatgagg 
accgtgcgca 
ggtgagcaag 
cagcggcgct 
attggcaagc 
tcgcgcgtct 
ggtgccggtg 
atgccgcagt 
gtggagtacg 
gatgtcgcga 
acctcgctgt 
gcaacggtga 
gcgqaqaacc 



gtgcgcagcg 
cggctgaggc 
tcatccagaa 
caccgacgcc 
ccgccgcggc 
gcccgttgcc 
tgagcaacgg 
tcgtggccgc 
agcgggtgtg 
tgagcatcgt 
tgcggcggcg 
tttcgcccac 
ccgagagcga 
aagtcggtgc 
qctatcrtcaq 



ctggaaagct 
gatttcgtcg 
cgtaacggtg 
ggcgcctacc 
tgcggccaac 
ggcgcagcta 
ggaccggctg 
ggacgacacg 
tgtgcacact 
cggcacacca 
caagaacggc 
gccacggcca 
tcggcatggc 
ggcaggacaa 
ccttqaqccq 



atccgcggtg 
cgggttctct 
tatccggacg 
ccgcccagtg 
atgtgcgggc 
gtcaccgaga 
atgattcccg 
atcgccaaga 
cgcgaccaag 
cggcccgcgc 
gatgacggca 
gccagtgtca 
ttcgaggtga 
aactggctgg 
qtcacqatqt 



aagccgggtg 
cgcaagcctg 
cgacgtgcac 
tgatcttgcg 
cacgtccaca 
tcggaccgtc 
ttaccgacgc 
ggatcgtgat 
agcgttgggc 
cgcgcaccac 
aatctgaagg 
tcaccattgc 
ccatcgaaca 
ttgagatgga 
cqataqqccq 



gatgacgacg 
gacgctgcgt 
cgcgaccatc 
tcggctcaat 
cctacgcgga 
gggggtgttg 
cggtgagctg 
tcgcgtcgtc 
cagcgtacgc 
tgtcggcgtc 
cagcggtgtc 
ccgacccggt 
aatcgatcgg 
aatgttbcgt 
S 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1611 



28 



<210> 29 

<211> 620 

<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> CFP-10 + ESAT-6 



<400> 29 

atggcagaga 

atctccggcg 

ggccagtggc 

gcagccaata 

gtccaatact 

tgacccgcta 

cgggtatcga 

ttgacgaggg 

aggcgtacca 

tgcagaacct 

acgtcactgg 



tgaagaccga 
acctgaaaac 
gcggcgcggc 
agcagaagca 
cgagggccga 
atacgaaaag 
ggccgcggca 
gaagcagtcc 
gggtgtccag 
ggcgcggacg 
gatgttcgca 



tgccgctacc 
ccagatcgac 
ggggacggcc 
ggaactcgac 
cgaggagcag 
aaacggagca 
agcgcaatcc 
ctgaccaagc 
caaaaatggg 
atcagcgaag 



ctcgcgcagg 
caggtggagt 
gcccaggccg 
gagatctcga 
cagcaggcgc 
aaaacatgac 
agggaaatgt 
tcgcagcggc 
acgccacggc 
ccggtcaggc 



aggcaggtaa 
cgacggcagg 
cggtggtgcg 
cgaatattcg 
tgtcctcgca 
agagcagcag 
cacgtccatt 
ctggggcggt 
taccgagctg 
aatggcttcg 



tttcgagcgg 
ttcgttgcag 
cttccaagaa 
tcaggccggc 
aatgggcttc 
tggaatttcg 
cattccctcc 
agcggttcgg 
aacaacgcgc 
accgaaggca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
620 



<210> 30 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer SP6-BAC1 



<400> 30 

agttagctca ctcattaggc a 



21 



<210> 31 

<211> 21 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer T7-BAC1 



<400> 31 
_gg^^tgct_g 



:!g att a 



<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer esat-6F 



<400> .32 ■ - • 

gtcacgtcca ttcattccct 



20 



<210> 33 



<211> 19 




<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer esat-6R 

<400> 33 ^ ' . 

atcccagtga cgttgcctt 



<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer RDl' 
flanking region F 

<400> 34 

gcagtgcaaa ggtgcagata 



<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer RD1"^*= 
flanking region R 

<400> 35 

gattgagaca cttgccacga 2 0 



<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
plcA.int.F 

<400> 36 

caagttgggt ctggtcgaat 2 0 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
plcA.int.R 



<400> 37 

gctacccaag gtctcctggt 



20 



30 



<210> 38 
<211> 153 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<22 3> Sequences at the junction RDl"^*= 
<400> 38 

caagacgagg ttgtaaaacc tcgacgcagg atcggcgatg aaatgccagt cggcgtcgct 60 
gagcgcgcgc tgcgccgagt cccattttgt cgctgatttg tttgaacagc gacgaaccgg 12 0 
tgttgaaaat gtcgcctggg tcggggattc cct 153 



<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer RDS"^'^ 
flanking region F 

<400> 39 

gaatgccgac gtcatatcg 19 



<210> 40 

<211> 20 

<212> DNA 

<213> Artificial 



Secfuence 



<220> 

<223> Description of Artificial Sequence: Primer RD5 
flanking region R 

<400> 40 

cggccactga gttcgattat 



<210> 41 
<211> 152 
<212> DNA 

<213> Mycobacterium tuberculosis 

<220> 

<223> Sequence at the junction RD5"^'' 
<400> 41 

cctcgatgaa ccacctgaca tgaccccatc ctttccaaga actggagtct ccggacatgc 
cggggcggtt cactgcccca ggtgtcctgg gtcgttccgt tgaccgtcga gtccgaacat 
ccgtcattcc cggtggcagt cggtgcggtg ac 



<210> 42 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 



31 




<223> Description of Artificial Sequence: Primer MiDl 
flanking region F 



<400> 42 

cagccaacac caagtagacg 



20 



<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiDl 
flanking region R 

<400> 43 

tctacctgca gtcgcttgtg 2 0 



<210> 44 
<211> 123 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<22 3> Sequence at the junction MiDl 
<400> 44 . 

cacctgacat gaccccatcc tttccaagaa ctggagtctc cggacatgcc ggggcggttc 60 
agggacattc atgtccatct tctggcagat cagcagatcg cttgttctca gtgcaggtga 12 0 
gtc 123 



<210> 45 
<211> 20 
<212> DNA 

<213> Artificial Sequence ' 
<220> 

<223> Description of Artificial Sequence: Primer MiD2 
flanking region R 

<400> 45 

■gtccatcgag gatgtcgagt . 2-a 



<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiD2 
flanking region L 

<400> 46 

ctaggccatt ccgttgtctg 2 0 



<210> 47 



32 



<211> 151 
<212> DNA 

<213> Mycobacterium txiberculosis 
<220> 

<22 3> Sequence at .the junction MiD2 
<400> 47 

gctgcctact acgctcaacg ccagagacca gccgccggct gaggtctcag atcagagagt 60 
ctccggactc accggggcgg ttcataaagg cttcgagacc ggacgggctg taggttcctc 120 
aactgtgtgg cggatggtct gagcacttaa c 151 



<210> 48 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiD3 
flanking region R 



<210> 49 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiD3 
flanking region L 

<400> 49 

aactgtcggg cttgctctt 19 



<210> 50 
<211> 181 
<212> DNA 

<213> Mycobacterium tuberculosis 

^y.^.n^ 

<223> Sequence at the junction MiD3 



<400> 48 

ggcgacgcca tttcc 



15 



<400> 50 



tggcgccggc 
cggccgaacc 
cgagctacct 

g 



acctccgttg ccaccgttgc cgccgctggt gggcgcggtg ccgttcgccc 
gttcagggcc gggttcgccc tcagccgcta aacacgccga ccaagatcaa 
gcccggtcaa ggttgaagag cccccatatc agcaagggcc cggtgtcggc 



60 
120 
180 
181 



<210> 51 
<211> 108 
<212> PRT 



<213> Mycobacterium tuberculosis 



<;22Q?> 

<223> RV3861 



- hypothetical protein 



<400> 51 
Val Thr Trp Leu 
1 

Gin Ala Cys Lys 
20 

Ala Gin Arg Gly 
35 

Ser Arg Ala Val 
50 

Pro Leu Glu Pro 
65 

Arg His Pro Val 



Ser Gin Leu Ala 
100 




Ala Asp Pro Val 
5 

Thr Ser lie Ser 



Ala Gin Cys Gly 
40 

His lie Gin Gly 
55 

Ala Val Gin Ala 
70 

Val Ala His Arg 
85 

Gin Arg Glu Cys 



33 * 



Gly Asn Ser Arg 
10 

Ala Pro lie Val 
25 

Gin TVrg Glu Lys 



lie Ser Pro Pro 
60 

Ala Val Ala Ser 
75 

Val Thr Val Ala 
90 

Pro Arg Pro Ala 
105 




lie Ala Arg Ala 
15 

Glu Ser Trp Arg 
30 

Ser Cys Arg Cys 
45 

Leu Phe Arg Arg 



Cys Arg Leu Gly 
80 

Leu Gly Gin Gly 
95 



<210> 52 
<211> 116 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> WHIB6 - Possible transcriptional regulatory 
protein WHIB-like WHIB6 

<400> 52 



Met Arg Tyr 
1 


Ala 


Phe 
5 


Ala 


Ala 


Glu 


Ala 


Thr 
10 


Thr 


Cys 


Asn 


Ala 


Phe 
15 


Trp 


Arg Asn Val 


Asp 
20 


Met 


Thr 


Val 


Thr 


Ala 
25 


Leu 


Tyr 


Glu 


Val 


Pro 
30 


Leu 


Gly 


Val Cys Thr 
35 


Gin 


Asp 


Pro 


Asp 


Arg 
40 


Trp 


Thr 


Thr 


Thr 


Pro 
45 


Asp 


Asp 


Glu 


Ala Lys Thr 
50 


Leu 


Cys 


Arg 


Ala 
55 


Cys 


Pro 


Arg 


Arg 


Trp 
60 


Leu 


Cys 


Ala 


Arg 


Asp Ala Val 
65 


Glu 


Ser 


Ala 
70 


Gly 


Ala 


Glu 


Gly 


Leu 
75 


Trp 


Ala 


Gly 


Val 


Val 
80 


lie Pro Glu 


Ser 


Gly 
85 


Arg 


Ala 


Arg 


Ala 


Phe 
90 


Ala 


Leu 


Gly 


Gin 


Leu 
95 


Arg 


Ser Leu Ala 


Glu 
100 


Arg 


Asn 


Gly Tyr 


Pro 
105 


Val 


Arg 


Asp 


His 


Arg 
110 


Val 


Ser 


Ala Gin Ser 
115 


Ala 


























<210> 53 





























<212> PRT 




<213> Mycobacterium tuberculosis 
<220> 

<223> RV3 863 - hypothetical alanine rich protein 



<400> 53 



Met Ala 


Gly 


Glu 


Arg 
5 


Lys 


Val 


Cys 


Pro 


Pro 
10 


Ser 


Arg 


Leu 


Val 


Pro 
15 


Ala 


Asn Lys 


Gly 


Ser 
20 


Thr 


Gin 


Met 


Ser 


Lys 
25 


Ala Gly 


Ser 


Thr 


Val 
30 


Gly 


Pro 


Ala Pro 


Leu 
35 


Val 


Ala 


Cys 


Ser 


Gly 
40 


Gly 


Thr 


Ser 


Asp 


Val 
45 


He 


Glu 


Pro 


Arg Arg 

50 


Gly Val 


Ala 


He 


He 


Gly 


His 


Ser 


Cys 


Arg 
60 


Val 


Gly 


Thr 


Gin 


lie Asp 
65 


Asp 


Ser 


Arg 


He 
70 


Ser 


Gin 


Thr 


His 


Leu 
75 


Arg 


Ala 


Val 


Ser 


Asp 
80 


Asp Gly 


Arg 


Trp 


Arg 

O 3 


He 


Val 


Gly 


Asn 


He 
90 


Pro 


Arg 


Gly 


Met 


Phe 
95 


Val 


Gly Gly Arg Arg 
100 


Gly 


Ser 


Ser 


Val 


Thr 


Val 


Ser 


Asp 


Lys 


Thr 
110 


Leu 


He 




Gly 
115 


Asp 


Pro 


Pro 


Gly 


Gly 
120 


Lys 


Ala 


Leu 


Thr 


Phe 
125 


Glu 


Val 


Val 


Arg Pro 

130 


Ser 


Asp 


Ser 


Ala 


Ala 
i35 


Gin 


His 


Gly Arg 


Val 
140 


Gin 


Pro 


Ser 


Ala 


Asp Leu 
145 


Ser 


Asp 


Asp 


Pro 


Ala 


His 


Asn 


Ala 


Ala 
155 


Pro 


Val 


Ala 


Pro 


Asp 
160 


Pro Gly 


Val 


Val 


Arg 

± D 3 


Ala 


Gly Ala 


Ala 


Ala 
170 


Ala 


Ala 


Arg 


Arg 


Arg 
175 


Glu 


Leu Asp 


lie 


Ser 
180 


Gin 


Arg 


Ser 


Leu 


Ala 
185 


Ala 


Asp 


Gly 


He 


He 
190 


Asn 


Ala 


Gly Ala 


Leu 
195 


He 


Ala 


Phe 


Glu 


Lys 
200 


Gly Arg 


Ser 


Trp 


Pro 

205 


Arg 


Glu 


Arg 


Thr Arg 
210 


Ala 


Lys 


Leu 


Glu 


Glu 
215 


Val 


Leu 


Gin 


Trp 


Pro 
220 


Ala 


Gly 


Thr 


He 


Ala Arg 
225 


He 


Arg 


Arg 


Gly 
230 


Glu 


Pro 


Thr 


Glu 


Pro 
235 


Ala 


Thr 


Asn 


Pro 


Asp 
240 


Ala Ser 


Pro 


Gly 


Leu 
245 


Arg 


Pro 


Ala 


Asp 


Gly 
250 


Pro 


Ala 


Ser 


Leu 


He 
255 


Ala 


Gin Ala 


Val 


Thr 
260 


Ala 


Ala 


Val 


Asp 


Gly 
265 


Cys 


Ser 


Leu 


Ala 


He 
270 


Ala 


Ala 


Leu Pro 


Ala 
275 


Thr 


Glu 


Asp 


Pro 


Glu 
280 


Phe 


Thr 


Glu 


Arg 


Ala 
285 


Ala 


Pro 


He 


Leu Ala 


Asp 


Leu 


Arg 


Gin 


Leu 


Glu 


Ala 


He 


Ala 


Val 


Gin 


Ala 


Thr 


Arg 



290 

lie Ser Arg lie 
305 

Arg His His Asp 



Thr Leu Ala Gin 
340 

Thr Leu Glu Thr 
355 

Gly Ala Glu Ala 
370 

Glu Ala Leu lie 
385 



295 

Thr Pro Glu Leu 
310 

Glu Leu Met Arg 

325 . 

Arg Leu Tyr Ala 



Ala Gin Ala Ala 
360 

Glu Glu Glu Leu 
375 

Arg Gin lie Asn 
390 



300 

lie Lys Ala Leu 
315 

Leu Gly Ala Thr 
330 

Ala Arg Arg Arg 
345 

Gly Val Ala Glu 



Pro Ala Glu Ala 
380 



Gly Ala Val Arg 
320 

Ala Pro Gly Ala 

335 

Ala Asn Leu Ser 
350 

Glu Met lie Val 
365 

Thr Glu Ala lie 



<210> 54 
<211> 402 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3864 - conserved hypothetical protein 
.<400> 54 

Met Ala Ser Gly Ser Gly Leu Cys Lys Thr Thr Ser Asn Phe lie Trp 
1 5 10 15 

Gly Gin Leu Leu Leu Leu Gly Glu Gly lie Pro Asp Pro Gly Asp lie 
20 25 30 

Phe Asn Thr Gly Ser Ser Leu Phe Lys Gin lie Ser Asp Lys Met Gly 
35 40 45 

Leu Ala lie Pro Gly Thr Asn Trp lie Gly Gin Ala Ala Glu Ala Tyr 
50 55 60 

Leu Asn Gin Asn lie Ala Gin Gin Leu Arg Ala Gin Val Met Gly Asp 
65 70 75 80 



Leu Asp Lys Leu Thr Gly Asn Met lie Ser Asn Gin Ala Lys Tyr Val 
85 90 ' 95 

Ser Asp Thr Arg Asp Val Leu Arg Ala Met Lys Lys Met He Asp Gly 
100 105 110 

Val Tyr Lys Val Cys Lys Gly Leu Glu Lys He Pro Leu Leu Gly His 
115 120 125 

Leu Trp Ser Tirp Glu Leu Ala He Pro Met Ser Gly He Ala Met Ala 
130 135 140 

Val Val Gly Gly Ala Leu Leu Tyr Leu Thr He Met Thr Leu Met Asn 
145 150 155 160 



Ala Thr Asn Leu Arg Gly He Leu Gly Arg Leu He Glu Met Leu Thr 



36 



165 170 175 

Thr Leu Pro Lys Phe Pro Gly Leu Pro Gly Leu Pro Ser Leu Pro Asp 
180 185 190 

lie lie Asp Gly Leu Trp Pro Pro Lys Leu Pro Asp lie Pro lie Pro 
195 . 200 205 

Gly Leu Pro Asp lie Pro Gly Leu Pro Asp Phe Lys Trp Pro Pro Thr 
210 215 220 

Pro Gly Ser Pro Leu Phe Pro Asp Leu Pro Ser Phe Pro Gly Phe Pro 
225 230 235 240 

Gly Phe Pro Glu Phe Pro Ala lie Pro Gly Phe Pro Ala Leu Pro Gly 
245 250 255 

Leu Pro Ser lie Pro Asn Leu Phe Pro Gly Leu Pro Gly Leu Gly Asp 
260 265 270 

Leu Leu Pro Gly Val Gly Asp Leu Gly Lys Leu Pro Thr Trp Thr Glu 
275 280 285 

Leu Ala Ala Leu Pro Asp . Phe Leu Gly Gly Phe Ala Gly Leu Pro Ser 
290 295 300 

Leu Gly Phe Gly Asn Leu Leu Ser Phe Ala Ser Leu Pro Thr Val Gly 
305 310 315 320 

Gin Val Thr Ala Thr Met Gly Gin Leu Gin Gin Leu Val Ala Ala Gly 
325 330 335 

Gly Gly Pro Ser Gin Leu Ala Ser Met Gly Ser Gin Gin Ala Gin Leu 
340 345 350 

lie Ser Ser Gin Ala Gin Gin Gly Gly Gin Gin His Ala Thr Leu Val 
355 360 365 

Ser Asp Lys Lys Glu Asp Glu Glu Gly Val Ala Glu Ala Glu Arg Ala 
370 375 380 

Pro lie Asp Ala Gly Thr Ala Ala Ser Gin Arg Gly Gin Glu Gly Thr 
385 390 395 400 

Val Leu — 



55 
103 
PRT 

Mycobacterium tuberculosis 
<220> 

<223> RV3865 - conserved hypothetical protein 
<400> 55 

Met Thr Gly Phe Leu Gly Val Val Pro Ser Phe Leu Lys Val Leu Ala 
1 5 10 15 



<210> 
<211> 
<212> 
<213> 



Gly Met His Asn Glu lie Val Gly Asp lie Lys Arg Ala Thr Asp Thr 



20 25 30 

Val Ala Gly lie Ser Gly Arg Val Gin Leu Thr His Gly Ser Phe Thr 
35 40 45 

Ser Lys Phe Asn Asp Thr Leu Gin Glu Phe Glu Thr Thr Arg Ser Ser 

• 50 55 ■ 60 , 

Thr Gly Thr Gly Leu Gin Gly Val Thr Ser Gly Leu Ala Asn Asn Leu 
65 70 75 80 

Leu Ala Ala Ala Gly Ala Tyr Leu Lys Ala Asp Asp Gly Leu Ala Gly 
85 90 95 

Val He Asp Lys lie Phe Gly 
100 



<210> 56 
<211> 283 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3 866 - conserved hypothetical protein 
<400> 56 



Met 
1 


Thr 


Gly 


Pro 


Ser 
5 


Ala 


Ala 


Gly 


Arg 


Ala 
10 


Gly 


Thr 


Ala 


Asp 


Asn 
15 


Val 


Val 


Gly 


Val 


Glu 
20 


Val 


Thr 


He 


Asp 


Gly 
25 


Met 


Leu 


Val 


He 


Ala 
30 


Asp 


Arg 


Leu 


His 


Leu 
35 


Val 


Asp 


Phe 


Pro 


Val 
40 


Thr 


Leu 


Gly 


He 


Arg 
45 


Pro' 


Asn 


He 


Pro 


Gin 
50 


Glu 


Asp 


Leu 


Arg 


Asp 
55 


lie 


Val 


Trp 


Glu 


Gin 
60 


Val 


Gin 


Arg 


Asp 


Leu 
65 


Thr 


Ala 


Gin 


Gly 


Val 
70 


Leu 


Asp 


Leu 


His 


Gly 
75 


Glu 


Pro 


Gin 


Pro 


Thr 
80 


Val 


Ala 


Glu 


Met 


Val 
85 


Glu 


Thr 


Leu 


Gly 


Arg 
90 


Pro 


Asp 


Arg 


Thr 


Leu 
95 


Glu 


Gly 


Arg 


Trp 


Trp 
100 


Arg 


Arg 


Asp 


He 


Gly 
105 


Gly 


Val 


Met 


Val 


Arg 
110 


Phe 


Val 


Val 


Cys 


Arg 
115 


Arg 


Gly 


Asp 


Arg 


His 
120 


Val 


He 


Ala 


Ala 


Arg 
125 


Asp 


Gly 


Asp 


Met 


Leu 
130 


Val 


Leu 


Gin 


Leu 


Val 
135 


Ala 


Pro 


Gin 


Val 


Gly 
140 


Leu 


Ala 


Gly 


Met 


Val 
145 


Thr 


Ala 


Val 


Leu 


Gly 
150 


Pro 


Ala 


Glu 


Pro 


Ala 
155 


Asn 


Val 


Glu 


Pro 


Leu 
160 


Thr 


Gly 


Val 


Ala 


Thr 
165 


Glu 


Leu 


Ala 


Glu 


Cys 
170 


Thr 


Thr 


Ala 


Ser 


Gin 
175 


Leu 



Thr Gin Tyr Gly He Ala Pro Ala Ser Ala Arg Val Tyr Ala Glu He 



38 



- ISO 185 190 

Val Gly Asn Pro Thr Gly Trp Val Glu lie Val Ala Ser Gin Arg His 
195 200 205 

Pro Gly Gly Thr Thr Thr Gin Thr Asp Ala Ala Aia Gly Val Leu Asp 
210 215 220 

Ser Lys Leu Gly Arg Leu Val Ser Leu Pro Arg Arg Val Gly Gly Asp 
225 230 235 240 

Leu Tyr Gly Ser Phe Leu Pro Gly Thr Gin Gin Asn Leu Glu Arg Ala 
245 250 255 

Leu Asp Gly Leu Leu Glu Leu Leu Pro Ala Gly Ala Trp Leu Asp His 
260 265 270 

Thr Ser Asp His Ala. Gin Ala Ser Ser Arg Gly 
275 280 



<210> 57 
<211> 183 
<212> PRT 

<213> mycobacterium tuberculosis 
<220> 

<2 23> Protein sequence Rv3 867 
<400> 57 

Met Val Asp Pro Pro Gly Asn Asp Asp Asp His Gly Asp Leu Asp Ala 
1 5 10 .15 

Leu Asp Phe Ser Ala Ala His Thr Asn Glu Ala Ser Pro Leu Asp Ala 
20 25 30 . 

Leu Asp Asp Tyr Ala Pro Val Gin Thr Asp Asp Ala Glu Gly Asp Leu 
35 40 45 

Asp Ala Leu His Ala Leu Thr Glu Arg Asp Glu Glu Pro Glu Leu Glu 
50 55 60 

Leu Phe Thr Val Thr Asn Pro Gin Gly Ser Val Ser Val Ser Thr Leu 
SS XO 2.5 80 — 



Met Asp Gly Arg lie Gin His Val Glu Leu Thr Asp Lys Ala Thr Ser 
85 90 95 

Met Ser Glu Ala Gin Leu Ala Asp Glu He Phe Val He Ala Asp Leu 
100 105 110 

Ala Arg Gin Lys Ala Arg Ala Ser Gin Tyr Thr Phe Met Val Glu Asn 
115 120 125 

He Gly Glu Leu Thr Asp Glu Asp Ala Glu Gly Ser Ala Leu Leu Arg 
130 135 140 



Glu Phe Val Gly Met Thr Leu Asn Leu Pro Thr Pro Glu Glu Ala Ala 
1 1 1 : 1 60 



• # 

Ala Ala Glu Ala Glu Val Phe Ala Thr Arg Tyr Asp Val Asp Tyr Thr 
165 170 175 

Ser Arg Tyr Lys Ala Asp Asp 
180 

<210> 58 • . ' ■ 

<211> 573 
<212> PRT 

<213> mycobacterium tuberculosis 



<220> 

<223> Protein sequence Rv3868 
<400> 58 

Met Thr Asp Arg Leu Ala Ser Leu 
1 5 



Phe Glu Ser Ala Val Ser Met Leu 
10 15 



Pro Met Ser Glu 
20 

Tyr Asp Glu Ser 
35 

Asp Thr Asp Arg 
50 

Phe Gly Gin Leu 
65 

Ala Arg lie Ala 



Ala Arg Ser Leu 



Ala Cys Asp Ala 
40 

Val Thr Leu Phe 
55 

Ser Gly Ser Val 
70 

lie Gly Gly Leu 
85 



Asp Leu Phe Thr 
25 

Trp lie Gly Arg 



Arg Ala Trp Tyr 
60 

Gin lie Ser Met 
75 

Tyr Gly Asp lie 
90 



Glu lie Thr Asn 
30 

lie Arg Cys Gly 
45 

Ser Arg Arg Asn 



Ser Thr Leu Asn 
80 

Thr Tyr Pro Val 
95 



Thr Ser Pro Leu 
100 

Gin Gly Asn Tyr 
115 



Ala lie Thr Met 

Ala Asp Ala Met 
120 



Gly Phe Ala Ala 
105 

Glu Ala Leu Glu 



Cys Glu Ala Ala 
110 

Ala Ala Pro Val 
125 



Ala Gly Ser Glu His Leu Val Ala Trp Met Lys Ala Val Val Tyr Gly 
130 135 140 



Ala Ala Glu Arg Trp Thr Asp Val lie Asp Gin Val Lys Ser Ala Gly 
145 150 155 160 



Lys Trp Pro Asp 



Val Ala Ala Ala 
180 

Thr Glu Ala Asn 
195 

Ala Trp Tyr Leu 
210 

Ala Val Ala Leu 
225 



Lys Phe Leu Ala 
165 

Asn Leu Ala Leu 



Asp Ser Pro Ala 
200 

Ala Met Ala Arg 
215 

Leu Glu Trp Leu 
230 



Gly Ala Ala Gly 
170 

Phe Thr Glu Ala 
185 

Gly Glu Ala Cys 



Ai-g Ser Gin Gly 

220 

Gin Thr Thr His 
235 



Val Ala His Gly 
175 

Glu Arg Arg Leu 
190 

Ala Arg Ala lie 
205 

Asn Glu Ser Ala 



Pro Glu Pro Lys 
240 



Val Ala Ala Ala Leu Lys Asp Pro Ser Tyr Arg Leu Lys Thr Thr Thr 



245 250 255 

Ala Glu Gin lie Ala Ser Arg Ala Asp Pro Trp Asp Pro Gly Ser Val 
260 265 , 270 

Val Thr Asp Asn. Ser Gly Arg Glu Arg Leu Leu Ala Glu Ala Gin Ala 
275 .280 285 

Glu Leu Asp Arg Gin lie Gly Leu Thr Arg Val Lys Asn Gin lie Glu 
290 295 300 

Arg Tyr Arg Ala Ala Thr Leu Met Ala Arg Val Arg Ala Ala Lys Gly 
305 310 315 320 

Met Lys Val Ala Gin Pro Ser Lys His Met He Phe Thr Gly Pro Pro 
325 330 335 

Gly Thr Gly Lys Thr Thr He Ala Arg Val Val Ala Asn He Leu Ala 
340 345 350 

Gly Leu Gly Val lie Ala Glu Pro Lys Leu Val Glu Thr Ser Arg Lys 
355 360 365 

Asp Phe Val Ala Glu Tyr Glu Gly Gin Ser Ala Val Lys Thr Ala Lys 
370 375 380 

Thr He Asp Gin Ala Leu Gly Gly Val Leu Phe He Asp Glu Ala Tyr 
385 390 395 400 

Ala Leu Val Gin Glu Arg Asp Gly Arg Thr Asp Pro Phe Gly Gin Glu 
405 410 415 

Ala Leu Asp Thr Leu Leu Ala Arg Met Glu Asn Asp Arg Asp Arg Leu 
420 425 430 

Val Val He He Ala Gly Tyr Ser Ser Asp He Asp Arg Leu Leu Glu 
435 440 445 

Thr Asn Glu Gly Leu Arg Ser Arg Phe Ala Thr Arg He Glu Phe Asp 
450 455 460 

Thr Tyr Ser Pro Glu Glu Leu Leu Glu He Ala Asn Val He Ala Ala 
465 470 475 480 

A la A sp A sp Ser A1 a Tieu Thr Ala Olii Ala Ala Glii Asn Php Leu Gin — 
485 490 495 

Ala Ala Lys Gin Leu Glu Gin Arg Met Leu Arg Gly Arg Arg Ala Leu 
500 505 510 

Asp Val Ala Gly Asn Gly Arg Tyr Ala Arg Gin Leu Val Glu Ala Ser 
515 520 525 

Glu Gin Cys Arg Asp Met Arg Leu Ala Gin Val Leu Asp He Asp Thr 
530 535 540 

Leu Asp Glu Asp Arg Leu Arg Glu He Asn Gly Ser Asp Met Ala Glu 
545 550 555 560 



Ala T1<=> Ala Ala Val His Ala Wi s T.mi Asn Met Arg Glu 
565 570 
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<210> 59 
<211> 480 
<212> PRT 

<213> tnycobacterium tuberculosis 

<220> 

<22 3> Protein sequence Rv3 869 
<400> 59 

Met Gly Leu Arg Leu Thr Thr Lys Val Gin Val Ser Gly Trp Arg Phe 
1 5 10 15 



Leu 


Leu 


Arg 


Arg 
^ yj 


Leu 


Glu 


His 


Ala 


He 
25 


Val 


Arg 


Arg 


Asp 


Thr 
30 


Arg 


Met 


Phe 


Asp 


Asp 

J D 


Pro 


Leu 


Gin 


Phe 


Tyr 
40 


Ser 


Arg 


Ser . 


He 


Ala 


Leu 


Gly 


He 


Val 


Val 
50 


Ala 


Val 


Leu 


He 


Leu 
55 


Ala 


Gly Ala 


Ala 


Leu 
60 


Leu 


Ala 


Tyr 


Phe 


Lys 

03 


Pro 


Gin 


Gly 


Lys . 


Leu 


Gly 


Gly 


Thr 


Ser 


Leu 

/ 3 


Phe 


Thr 


Asp 


Arg 


Ala 
80 


Thr 


Asn 


Gin 


Leu 


Tyr 

O D 


Val 


Leu 


Leu 


Ser 


Gly 
90 


Gin 


Leu 


His 


Pro 


Val 


Tyr 


Asn 


Leu 


Thr 


Ser 

inn 

JL u u 


Ala 


Arg 


Leu 


Val 


Leu 
105 


Gly 


Asn 


Pro 


Ala 


Asn 
110 


Pro 


Ala 


Thr 


Val 


Lys 

T 1 ^ 
J. X 3 


Ser 


Ser 


Glu 


Leu 


Ser Lys 
120 


Leu 


Pro 


Met 


Gly 
125 


Gin 


Thr 


Val 


Gly 


He 
130 


Pro 


Gly 


Ala 


Pro 


Tyr 
135 


Ala 


Thr 


Pro 


Val 


Ser 
140 


Ala 


Gly 


Ser 


Thr 


Ser 
145 


He 


Trp 


Thr 


Leu 


Cys 
150 


Asp 


Thr 


Val 


Ala 


Arg 
155 


Ala 


Asp 


Ser 


Thr 


Ser 
160 


Pro 


Val 


Val 


Gin 


Thr 
165 


Ala 


Val 


He 


Ala 


Met 
170 


Pro 


Leu 


Glu 


He 


Asp 
175 


Ala 


Ser 


He 


Asp 


Pro 
180 


Leu 


Gin 


Ser 


His 


Glu 
185 


Ala 


Val 


Leu 


Val 


Ser 
190 


Tyr 


Gin 


Gly 


Glu 


Thr 
195 


Trp 


He 


Val 


Thr 


Thr 
200 


Lys 


Gly 


Arg 


His 


Ala 
205 


He 


Asp 


Leu 


Thr 


Asp 
210 


Arg 


Ala 


Leu 


Thr 


Ser 
215 


Ser 


Met 


Gly 


He 


Pro 
220 


Val 


Thr 


Ala 


Arg 


Pro 
225 


Thr 


Pro 


He 


Ser 


Glu 
230 


Gly 


Met 


Phe 


Asn 


Ala 
235 


Leu 


Pro 


Asp 


Met 


Gly 
240 


Pro 


Trp 


Gin 


Leu 


Pro 
245 


Pro 


He 


Pro 


Ala 


Ala 
250 


Gly 


Ala 


Pro 


Asn 


Ser 
255 


Leu 



Gly Leu Pro Asp Asp Leu Val He Gly Ser Val Phe Gin He His Thr 
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260 265 270 

Asp Lys Gly Pro Gin Tyr Tyr Val Val Leu Pro Asp Gly lie Ala Gin 
275 280 285 

Val Asn Ala Thr Thr Ala Ala Ala Leu TVrg Ala Thr Gin Ala His Gly 

290 . 295 300 

Leu Val Ala Pro Pro Ala Met Val Pro Ser Leu Val Val Arg lie Ala 
305 310 315 320 

Glu Arg Val Tyr Pro Ser Pro Leu Pro Asp Glu Pro Leu Lys lie Val 
325 330 335 

Ser Arg Pro Gin Asp Pro Ala Leu Cys Trp Ser Trp Gin Arg Ser Ala 
340 345 350 

Gly Asp Gin Ser Pro Gin Ser Thr Val Leu Ser Gly Arg His Leu Pro 
355 360 365 

lie Ser Pro Ser Ala Met Asn Met Gly lie Lys Gin lie His Gly Thr 
370 375 380 

Ala Thr Val Tyr Leu Asp Gly Gly Lys .Phe Val Ala Leu Gin Ser Pro 
385 390 395 400 

Asp Pro Arg Tyr Thr Glu Ser Met Tyr Tyr lie Asp Pro Gin Gly Val 
405 410 415 

Arg Tyr Gly Val Pro Asn Ala Glu Thr Ala Lys Ser Leu Gly Leu Ser 
420 425 430 

Ser Pro Gin Asn Ala Pro Trp Glu lie Val Arg Leu Leu Val Asp Gly 
435 440 445 

Pro Val Leu Ser Lys Asp Ala Ala Leu Leu Glu His Asp Thr Leu Pro 
450 455 460 

Ala Asp Pro Ser Pro Arg Lys Val Pro Ala Gly Ala Ser Gly Ala Pro 
465 470 475 480 



<210> 60 
.<211> 747 

<212> PRT 

<213> mycobacterium tuberculosis 

<220> 

<22 3> Protein sequence Rv3870 

<400> 60 

Met Thr Thr Lys Lys Phe Thr Pro Thr lie Thr Arg Gly Pro Arg Leu 
15 10 15 

Thr Pro Gly Glu lie Ser Leu Thr Pro Pro Asp Asp Leu Gly lie Asp 
20 25 30 . 



He Pro Pro Ser Gly Val Gin Lys He Leu Pro Tyr Val Met Gly Gly 
35 40 45 
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Ala Met Leu Gly Met lie Ala lie Met Val Ala Gly Gly Thr Arg Gin 
50 55 60 

Leu Ser Pro Tyr Met Leu Met Met Pro Leu Met Met lie Val Met Met 
65 70 75 80 

Val Gly Gly Leu Ala Gly Ser Thr Gly Gly Gly Gly Lys Lys yal Pro 
85 90 95 

Glu lie Asn Ala Asp Arg Lys Glu Tyr Leu Arg Tyr Leu Ala Gly Leu 
100 105 110 

Arg Thr Arg Val Thr Ser Ser Ala Thr Ser Gin Val Ala Phe Phe Ser 
115 120 125 

Tyr His Ala Pro His Pro Glu Asp Leu Leu Ser lie Val Gly Thr Gin 
130 135 140 

Arg Gin Trp Ser Arg Pro Ala Asn Ala Asp Phe Tyr Ala Ala Thr Arg 
145 150 155 160 

lie Gly lie Gly Asp Gin Pro Ala Val Asp Arg Leu Leu Lys Pro Ala 
165 170 175 

Val Gly Gly Glu Leu Ala Ala Ala Ser Ala Ala Pro Gin Pro Phe Leu 
180 185 .190 

Glu Pro Val Ser His Met Trp Val Val Lys Phe Leu Arg Thr His Gly 
195 200 205 

Leu lie His Asp Cys Pro Lys Leu Leu Gin Leu Arg Thr Phe Pro Thr 
210 215 220 

lie Ala lie Gly Gly Asp Leu Ala Gly Ala Ala Gly Leu Met Thr Ala 
225 230 235 240 

Met lie Cys His Leu Ala Val Phe His Pro Pro Asp Leu Leu Gin lie 
245 250 255 

Arg Val Leu Thr Glu Glu Pro Asp Asp Pro Asp Trp Ser Trp Leu Lys 
260 265 270 

Trp Leu Pro His Val Gin His Gin Thr Glu Thr Asp Ala Ala Gly Ser 
275 280 285 



Thr Arg Leu lie Phe Thr Arg Gin Glu Gly Leu Ser Asp Leu Ala Ala 
290 295 300 

Arg Gly Pro His Ala Pro Asp Ser Leu Pro Gly Gly Pro Tyr Val Val 

305 310 315 320 

Val Val Asp Leu Thr Gly Gly Lys Ala Gly Phe Pro Pro Asp Gly Arg 
325 330 335 

Ala Gly Val Thr Val lie Thr Leu Gly Asn His Arg Gly Ser Ala Tyr 
340 345 ■ 350 

Arg lie Arg Val His Glu Asp Gly Thr Ala Asp Asp Arg Leu Pro Asn 
355 360 365 



Gin Ser Phe Arg Gin Val Thr Ser Val Thr Asp Arg Met Ser Pro Gin 
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370 375 380 • ' 

Gin Ala Ser Arg lie Ala Arg Lys Leu Ala Gly Trp Ser lie Thr Gly 
385 390 395 - 400 

Thr lie Leu Asp Lys Thr Ser Arg Val Gin Lys* Lys Val Ala Thr Asp. 

405 - 410 . * . * 415 

Trp His Gin Leu Val Gly Ala Gin Ser Val Glu Glu lie Thr Pro Ser 
420 425 430 

Arg Trp Arg Met Tyr Thr Asp Thr Asp Arg Asp Arg Leu Lys lie Pro 
435 440 445 

Phe Gly His Glu Leu Lys Thr Gly Asn Val Met Tyr Leu Asp lie Lys 
450 455 460 

Glu Gly Ala Glu Phe Gly Ala Gly Pro His Gly Met Leu lie Gly Thr 
465 470 475 480 

Thr Gly Ser Gly Lys Ser Glu Phe Leu Arg Thr Leu lie Leu Ser Leu 
485 490 495 

Val Ala Met Thr His Pro Asp Gin Val Asn Leu Leu Leu Thr Asp Phe 
500 505 510 

Lys Gly Gly Ser Thr Phe Leu Gly Met Glu Lys Leu Pro His Thr Ala 
515 520 525 

Ala Val Val Thr Asn Met Ala Glu Glu Ala Glu Leu Val Ser Arg Met 
530 535 540 

Gly Glu Val Leu Thr Gly Glu Leu Asp Arg Arg Gin Ser lie Leu Arg 
545 550 555 560 

Gin Ala Gly Met Lys Val Gly Ala Ala Gly Ala Leu Ser Gly Val Ala 
565 570 575 

Glu Tyr Glu Lys Tyr Arg Glu Arg Gly Ala Asp Leu Pro Pro Leu Pro 
580 585 590 

Thr Leu Phe Val Val Val Asp Glu Phe Ala Glu Leu Leu Gin Ser His 
595 600 605 

Pro Asp Phe lie G l y L eu Phe A s p Arg lie Cys Arg Val Gly Arg Ser 



610 615 620 

Leu Arg Val His Leu Leu Leu Ala Thr Gin Ser Leu Gin Thr Gly Gly 
625 630 635 640 

Val Arg lie Asp Lys Leu Glu Pro Asn Leu Thr Tyr Arg lie Ala Leu 
645 650 655 

Arg Thr Thr Ser Ser His Glu Ser Lys Ala Val lie Gly Thr Pro Glu 
660 665 670 

Ala Gin Tyr lie Thr Asn Lys Glu Ser Gly Val Gly Phe Leu Arg Val 
675 680 685 

Gly MR^ nin Asp Pto Val Tiys Phft Ser Thr Phe Tyr Tip fler Gly Pro 

690 695 700 




Tyr Met Pro Pro Ala Ala Gly Val Glu Thr Asn Gly Glu Ala Gly Gly 
705 710 715 720 

Pro Gly Gin Gin Thr Thr Arg Gin Ala Ala Arg lie His Arg Phe Thr 
725 • 730- • - 73*5 

Ala Ala Pro Val Leu Glu Glu Ala Pro Thr Pro 
740 745 



<210> 61 
<211> 591 
<212> PRT 

<213> mycobacterium tuberculosis 
<220> 

<223> Protein sequence Rv3871 
<400> 61 

Met Thr Ala Glu Pro Glu Val Arg Thr Leu Arg Glu Val Val Leu Asp 
15 10 15 

Gin Leu Gly Thr Ala Glu Ser Arg Ala Tyr Lys Met Trp Leu Pro Pro 
20 25 30 

Leu Thr Asn Pro Val Pro Leu Asn Glu Leu lie Ala Arg Asp Arg Arg 
35 40 45 

Gin Pro Leu Arg Phe Ala Leu Gly lie Met Asp Glu Pro Arg Arg His 
'50 55 60 

Leu Gin Asp Val Trp Gly Val Asp Val Ser Gly Ala Gly Gly Asn lie 
65 70 75 80 

Gly lie Gly Gly Ala Pro Gin Thr Gly Lys Ser Thr Leu Leu Gin Thr 
85 90 - 95 

Met Val Met Ser Ala Ala Ala Thr His Ser Pro Arg Asn Val Gin Phe 
100 105 110 

Tyr Cys lie Asp Leu Gly Gly Gly Gly Leu lie Tyr Leu Glu Asn Leu 
115 120 125 



Pro His Val Gly Gly Val Ala Asn Arg Ser Glu Pro Asp Lys Val Asn 
130 135 140 

Arg Val Val Ala Glu Met Gin Ala Val Met Arg Gin Arg Glu Thr Thr 
145 150 155 160 

Phe Lys Glu His Arg Val Gly Ser lie Gly Met Tyr Arg Gin Leu Arg 
165 170 175 

Asp Asp Pro Ser Gin Pro Val Ala Ser Asp Pro Tyr Gly Asp Val Phe 
180 185 190 

Leu lie lie Asp Gly Trp Pro Gly Phe Val Gly Glu Phe Pro Asp Leu 
195 200 205 



Glu Gly Gin Val Gin Asp Leu Ala Ala Gin Gly Leu Ala Phe Gly Val 
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210 215 220 

His Val lie lie Ser Thr Pro Arg Trp Thr Glu Leu Lys Ser Arg Val 
225 230 235 . 240 

Arg Asp Tyr Leu Gly Thr Lys lie Glu Phe Arg Leu Gly Asp Val Asn 

245 250 - 255 

Glu Thr Gin lie Asp Arg lie Thr Arg Glu lie Pro Ala Asn Arg Pro 
260 265 270 

Gly Arg Ala Val Ser Met Glu Lys His His Leu Met He Gly Val -Pro 
275 280 285 

Arg Phe Asp Gly Val His Ser Ala Asp Asn Leu Val Glu Ala He Thr 
290 295 300 

Ala Gly Val Thr Gin He Ala Ser Gin His Thr Glu Gin Ala Pro Pro 
305 310 315 320 

Val Arg Val Leu Pro Glu Arg He His Leu His Glu Leu Asp Pro Asn 
325 330 335 

Pro Pro Gly Pro Glu Ser Asp Tyr Arg Thr Arg Trp Glu He Pro He 
340 345 350 

Gly Leu Arg Glu Thr Asp Leu Thr Pro Ala His Cys His Met His Thr 
355 360 365 

Asn Pro His Leu Leu He Phe Gly Ala Ala Lys Ser Gly Lys Thr Thr 
370 375 380 

He Ala His Ala He Ala Arg Ala He Cys Ala Arg Asn Ser Pro Gin 
385 390 395 400 

Gin Val Arg Phe Met Leu Ala Asp Tyr -Arg Ser Gly . Leu Leu Asp Ala 
405 410 415 

Val Pro Asp Thr His Leu Leu Gly Ala Gly Ala He Asn Arg Asn Ser 
420 425 430 

Ala Ser Leu Asp Glu Ala Val Gin Ala Leu Ala Val Asn Leu Lys Lys 
435 440 445 

Arg Leu Pro Pro Thr Asp Leu Thr Thr Ala Gin Leu Arg Ser Arg Ser 



450 455 460 

Trp Trp Ser Gly Phe Asp Val Val Leu Leu Val Asp Asp Trp His Met 
465 470 475 480 

He Val Gly Ala Ala Gly Gly Met Pro Pro Met Ala Pro Leu Ala Pro 
485 490 495 

Leu Leu Pro Ala Ala Ala Asp He Gly Leu His He He Val Thr Cys 
500 505 510 

Gin Met Ser Gin Ala Tyr Lys Ala Thr Met Asp Lys Phe Val Gly Ala 
515 520 525 

Ala Phe Glv Ser Glv Ala Pro Thr Met Phe Leu Ser Gly Glu Lys Gin 
530 535 540 
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Glu Phe Pro Ser Ser Glu Phe Lys Val Lys Arg Arg Pro Pro Gly Gin 
545 550 555 560 

Ala Phe Leu Val Ser Pro Asp Gly Lys Glu Val lie Gin Ala Pro Tyr 
565 570 575 - 

lie Glu Pro Pro Glu Glu Val Phe Ala Ala Pro Pro Ser Ala Gly 
580 585 590 



<210> 62 
<211> 99 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> RV3872-PE35 - PE family- re la ted protein 
<400> 62 

Met Glu Lys Met Ser His Asp Pro lie Ala Ala Asp lie Gly Thr Gin 
15 10 15 

Val Ser Asp Asn Ala Leu His Gly Val Thr Ala Gly Ser Thr Ala Leu 
20 25 30 

Thr Ser Val Thr Gly Leu Val Pro Ala Gly Ala Asp Glu Val Ser Ala 
35 40 45 

Gin Ala Ala Thr Ala Phe Thr Ser Glu Gly lie Gin Leu Leu Ala Ser 
50 55 60 

Asn Ala Ser Ala Gin Asp Gin Leu His Arg Ala Gly Glu Ala Val Gin 
65 70 75 80 

Asp Val Ala Arg Thr Tyr Ser Gin lie Asp Asp Gly Ala Ala Gly Val 
85 90 95 

Phe Ala Glu 



<210> 63 
<211>^ 368 

<212> PRT 

<213> Mycobacterium tuberculosis 

<220> 

<223> Rv3873-PPE68 - PPE family protein 
<400> 63 

Met Leu Trp His Ala Met Pro Pro Glu Leu Asn Thr Ala Arg Leu Met 
1 5 10 15. 

Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala Gly Trp Gin 
20 25 30 

Thr Leu Ser Ala Ala Leu Asp Ala Gin Ala Val Glu Leu Thr Ala Arg 
35 40 45 



Leu Asn Ser Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser Asp Lys Ala 
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50 55 60 

Leu Ala Ala Ala Thr Pro Met Val Val Trp Leu Gin Thr Ala Ser Thr 
65 70 75 80 

Gin Ala Lys Thr Arg Ala Met Gin Ala Thr Ala Gin Ala Ala Ala Tyr 

85 . - . 90 55 . 

Thr Gin Ala Met Ala Thr Thr Pro Ser Leu Pro Glu lie Ala Ala Asn 
100 105 110 

His lie Thr Gin Ala Val Leu Thr Ala Thr Asn Phe Phe Gly lie Asn 
115 120 125 

Thr lie Pro lie Ala Leu Thr Glu Met Asp Tyr Phe lie Arg Met Trp 
130 135 140 

Asn Gin Ala Ala Leu Ala Met Glu Val Tyr Gin Ala Glu Thr Ala Val 
145 150 155 160 

Asn Thr Leu Phe Glu Lys Leu Glu Pro Met Ala Ser lie Leu Asp Pro 
165 170 175 

Gly Ala Ser Gin Ser Thr Thr Asn Pro lie Phe Gly Met Pro Ser Pro 
180 185 190 

Gly Ser Ser Thr Pro Val Gly Gin Leu Pro Pro Ala Ala Thr Gin Thr 
195 200 205 

Leu Gly Gin Leu Gly Glu Met Ser Gly Pro Met Gin Gin Leu Thr Gin 
210 215 220 

Pro Leu Gin Gin Val Thr Ser Leu Phe Ser Gin Val Gly Gly Thr Gly 
225 230 235 240 

Gly Gly Asn Pro Ala Asp Glu Glu Ala Ala Gin Met Gly Leu Leu Gly 
245 250 255 

Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser Gly Pro Ser 
260 265 270 

Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly Ala Gly Gly 
275 280 285 

Ser Leu Thr Arg Thr Pro Leu Met Ser Gin Leu 1 1 niij Lys P-rn Val 
290 295 300 

Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser Ala Thr Gly 
305 310 315 320 

Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gin Gly Ala Gin Ser 
325 330 335 

Gly Gly Ser Thr Arg Pro Gly Leu *Val Ala Pro Ala Pro Leu Ala* Gin 
340 345 350 



Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu Asp Asp Trp 
355 ' 360 365 



<210> 64 
<211> 100 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3874-esxB - 10)cDa culture filtrate antigen CFPIO 
<400> 64 

Met Ala Glu Met Lys Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly 
15 10 15 

Asn Phe Glu Arg lie Ser Gly Asp Leu Lys Thr Gin lie Asp Gin Val 
20 25 30 

Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly 
35 40 45 

Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys 
50 55 60 

Gin Lys Gin Glu Leu Asp Glu lie Ser Thr Asn lie Arg Gin Ala Gly 
65 70 75 80 

Val Gin Tyr Ser Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser 
85 90 95 

Gin Met Gly Phe 
100 



<210> 65 
<211> 95 
<212> PRT 

<213> Mycobacterium tuberculosis 

<220> 

<223> Rv3875-Esat6 - 6 kDa early secretory antigenic 
target Esat6 (Esat-6) 

<400> 65 

Met Thr Glu Gin Gin Trp Asn Phe Ala Gly He Glu Ala Ala Ala Ser 
15 10 15 



Ala He Gin Gly Asn Val Thr Ser He His Ser Leu Leu Asp Glu Gly 
20 25 30 

Lys Gin Ser Leu Thr Lys Leu Ala Ala Ala Trp Gly Gly Ser Gly Ser 
35 40 45 

Glu Ala Tyr Gin Gly Val Gin Gin Lys Trp Asp Ala Thr Ala Thr Glu 
50 55 60 

Leu Asn Asn Ala Leu Gin Asn Leu Ala Arg Thr He Ser Glu Ala Gly 
65 70 75 80 

Gin Ala Met Ala Ser Thr Glu Gly Asn Val Thr Gly Met Phe Ala 
85 90 95 
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<210> 66 
<211> 666 
<212> PRT 

<213> mycobacterium tuberculosis 
<220> 

. <223> Protein sequence Rv3876 
<400> 66 

Met Ala Ala Asp Tyr Asp Lys Leu Phe Arg Pro His Glu Gly Met Glu 
15 10 15 

Ala Pro Asp Asp Met Ala Ala Gin Pro Phe Phe Asp Pro Ser Ala Ser 
20 25 30 

Phe Pro Pro Ala Pro Ala Ser Ala Asn Leu Pro Lys Pro Asn Gly Gin 
35 40 45 

Thr Pro Pro Pro Thr Ser Asp Asp Leu Ser Glu Arg Phe Val Ser Ala 
50 55 60 

Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Thr Pro Met 
65 70 75 80 

Pro lie Ala Ala Gly Glu Pro Pro Ser Pro Glu Pro Ala Ala Ser Lys 
85 90 95 

Pro Pro Thr Pro Pro Met Pro lie Ala Gly Pro Glu Pro Ala Pro Pro 
100 105 110 

Lys Pro Pro Thr Pro Pro Met Pro lie Ala Gly Pro Glu Pro Ala Pro 
115 120 125 

Pro Lys Pro Pro Thr Pro Pro Met Pro lie Ala Gly Pro Ala Pro Thr 
130 135 140 

Pro Thr Glu iser Gin Leu Ala Pro Pro Arg Pro Pro Thr Pro Gin Thr 

145 150 155 160 

Pro Thr Gly Ala Pro Gin Gin Pro Glu Ser Pro Ala Pro His Val Pro 
165 170 175 

Ser His Gly Pro His Gin Pro Arg Arg Thr Ala Pro Ala Pro Pro Trp 
180 IRS 1^ 



Ala Lys Met Pro lie Gly Glu Pro Pro Pro Ala Pro Ser Arg Pro Ser 
195 200 205 

Ala Ser Pro Ala Glu Pro Pro Thr Arg Pro Ala Pro Gin His Ser Arg 
210 215 220 

Arg Ala Arg Arg Gly His Arg Tyr Arg Thr Asp Thr Glu Arg Asn Val 
225 230 235 240 

Gly Lys Val Ala Thr Gly Pro Ser lie Gin Ala Arg Leu Arg Ala Glu 
245 250 255 

Glu Ala Ser Gly Ala Gin Leu Ala Pro Gly Thr Glu Pro Ser Pro Ala 
260 265 : 270 
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Pro Leu Gly Gin Pro Arg Ser Tyr Leu Ala Pro Pro Thr Arg Pro Ala 
275 280 285 

Pro Thr Glu Pro Pro Pro Ser Pro Ser Pro Gin Arg Asn Ser Gly Arg 
290 295 300 

Arg Ala Glu Arg Arg Val His Pro Asp Leu Ala Ala Gin His Ala Ala 
305- 310 315 ^ 320 

Ala Gin Pro Asp Ser lie Thr Ala Ala Thr Thr Gly Gly Arg Arg Arg 
325 330 335 

Lys Arg Ala Ala Pro Asp Leu Asp Ala Thr Gin Lys Ser Leu Arg Pro 
340 345 350 

Ala Ala Lys Gly Pro Lys Val Lys Lys Val Lys Pro Gin Lys Pro Lys 
355 360 365 

Ala Thr Lys Pro Pro Lys Val Val Ser Gin Arg Gly Trp Arg His Trp 
370 375 380 

Val His Ala Leu Thr Arg lie Asn Leu Gly Leu Ser Pro Asp Glu Lys 
385 390 395 400 

Tyr Glu Leu Asp Leu His Ala Arg Val Arg Arg Asn Pro Arg Gly Ser 
405 410 415 

Tyr Gin lie Ala Val Val Gly Leu Lys Gly Gly Ala Gly Lys Thr Thr 
420 425 430 

Leu Thr Ala Ala Leu Gly Ser Thr Leu Ala Gin Val Arg Ala Asp Arg 
435 440 445 

lie Leu Ala Leu Asp Ala Asp Pro Gly Ala Gly Asn Leu Ala Asp Arg 
450 455 460 

Val Gly Arg Gin Ser Gly Ala Thr lie Ala Asp Val Leu Ala Glu Lys 
465 470 475 480 

Glu Leu Ser His Tyr Asn Asp lie Arg Ala His Thr Ser Val Asn Ala 
485 490 495 . 

Val Asn Leu Glu Val Leu Pro Ala Pro Glu Tyr Ser Ser Ala Gin Arg 
500 505 510 



Ala Leu Ser Asp Ala Asp Trp His Phe lie Ala Asp Pro Ala Ser Arg 
515 520 525 

Phe Tyr Asn Leu Val Leu Ala Asp Cys Gly Ala Gly Phe Phe Asp Pro 
530 535 540 

Leu Thr Arg Gly Val Leu Ser Thr Val Ser Gly Val Val Val Val Ala 
545 550 555 560 

Ser Val Ser lie Asp Gly Ala Gin Gin Ala Ser Val Ala Leu Asp Trp 
565 570 575 

Leu Arg Asn Asn Gly Tyr Gin Asp Leu Ala Ser Arg Ala Cys Val Val 
580 585 590 



lie Asn His lie Met Pro Gly Glu Pro Asn Val Ala Val Lys Asp Leu 



595 

Val Arg His Phe Glu 
610 

Pro Trp Asp Arg His 
625' 

Leu Asp Pro lie Tyr 
645 

Ser Asp Asp Phe Glu 
660 
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600 

Gin Gin Val Gin Pro Gly 
615 

lie Ala Ala* Gly Thr Glu 
630 635 

Lys Arg Lys Val Leu Glu 
650 

Arg Ala Gly Arg Arg 
665 




605 

Arg Val Val Val Met 
620 

lie Ser Leu Asp Leu 
640 

Leu Ala Ala Ala Leu 
655 



<210> 67 

<211> 511 

<212> PRT 

<213> mycobacterium tuberculosis 
<220> 

<22 3> Protein sequence Rv3877 

<400> 67 

Met Ser Ala Pro Ala Val Ala Ala Gly Pro Thr Ala Ala Gly Ala Thr 
1 5 10 15 

Ala Ala- Arg Pro Ala Thr Thr Arg Val . Thr lie Leu Thr Gly Arg Arg 
20 25 30 

Met Thr Asp Leu Val Leu Pro Ala Ala Val Pro Met Glu Thr Tyr lie 
35 40 45 

Asp Asp Thr Val Ala Val Leu Ser Glu Val Leu Glu Asp Thr Pro Ala 
50 55 60 

Asp Val Leu Gly Gly Phe Asp Phe Thr Ala Gin Gly Val Trp Ala Phe 
65 70 75 80 

Ala Arg Pro Gly Ser Pro Pro Leu Lys Leu Asp Gin Ser Leu Asp Asp 
85 90 95 

Ala Gly Val Val Asp Gly Ser Leu Leu Thr Leu Val Ser Val Ser Arg 
100 105 110 



Thr Glu Arg Tyr Arg Pro Leu Val Glu Asp Val lie Asp Ala lie Ala 
115 120 125 

Val Leu Asp Glu Ser Pro Glu Phe Asp Arg Thr Ala Leu Asn - Arg Phe 
130 135 140 

Val Gly Ala Ala lie Pro Leu Leu Thr Ala Pro Val lie Gly Met Ala 

145 150 155 160 

Met Arg Ala Trp Trp Glu Thr Gly Arg Ser Leu Trp Trp Pro Leu Ala 
165 170 175 



lie Gly lie Leu Gly lie Ala Val Leu Val Gly Ser Phe Val Ala Asn 
180 185 190 
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Arg Phe Tyr Gin Ser Gly His Leu Ala Glu Cys Leu Leu Val Thr Thr 
195 200 205 

Tyr Leu Leu lie Ala Thr Ala Ala Ala Leu Ala Val Pro Leu Pro Arg 
210 215 220 

Gly Val Asn Ser Leu Gly Ala Pro Gin Val Ala Gly Ala Ala Thr Ala 
225 230 235 240 

Val Leu Phe Leu Thr Leu Met Thr Arg Gly Gly Pro Arg Lys Arg His 
245 250 255 

Glu Leu Ala Ser Phe Ala Val lie Thr Ala lie Ala Val lie Ala Ala 
260 265 270 

Ala Ala Ala Phe Gly Tyr Gly Tyr Gin Asp Trp Val Pro Ala Gly Gly 
275 280 285 

lie Ala Phe Gly Leu Phe lie Val Thr Asn Ala Ala Lys Leu Thr Val 
290 295 300 

Ala Val Ala Arg lie Ala Leu Pro Pro lie Pro Val Pro Gly Glu Thr 
305 310 315 320 

Val Asp Asn Glu Glu Leu Leu Asp Pro Val Ala Thr Pro Glu Ala Thr 
325 330 -335 

Ser Glu Glu Thr Pro Thr Trp Gin Ala lie lie Ala Ser Val Pro Ala 
340 345 350 

Ser Ala Val Arg Leu Thr Glu Arg Ser Lys Leu Ala Lys Gin Leu Leu 
355 360 365 

lie Gly Tyr Val Thr Ser Gly Thr Leu lie Leu Ala Ala Gly Ala lie 
370 375 . 380 

Ala Val Val Val Arg Gly His Phe Phe Val His Ser Leu Val Val Ala 
385 390 395 400 

Gly Leu lie Thr Thr Val Cys Gly Phe Arg Ser Arg Leu Tyr Ala Glu 
405 410 415 

Arg Trp Cys Ala Trp Ala Leu Leu Ala Ala Thr Val Ala lie Pro Thr 
420 425 430 



Gly Leu Thr Ala Lys Leu lie He Trp Tyr Pro His Tyr Ala Trp Leu 
435 440 445 

Leu Leu Ser Val Tyr Leu Thr Val Ala Leu Val Ala Leu Val Val Val 
450 455 460 

Gly Ser Met Ala His Val Arg Arg Val Ser Pro Val Val Lys Arg Thr 
465 470 475 480 

Leu Glu Leu He Asp Gly Ala Met He Ala Ala He He Pro Met Leu 
485 490 495 



Leu Trp He Thr Gly Val Tyr Asp Thr Val Arg Asn He Arg Phe 
500 505 510 




<2X0> 68 
<211> 280 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3878 - conserved hypothetical alanine rich 
protein 

<400> 68 - 

Met Ala Glu Pro Leu Ala Val Asp Pro Thr Gly Leu Ser Ala Ala Ala 
15 10 15 



Ala Lys Leu Ala Gly Leu Val Phe 
20 

Val Ser Gly Thr Asp Ser Val Val 
35 40 



Pro Gin Pro Pro Ala Pro lie Ala 
25 30 

Ala Ala lie Asn Glu Thr Met Pro 
45 



Ser lie Glu Ser Leu Val Ser Asp Gly Leu Pro Gly Val Lys Ala Ala 
50 55 60 

J- 

Leu Thr Arg Thr Ala Ser Asn Met Asn Ala Ala Ala Asp Val Tyr Ala 
65 70 75 80 



Lys Thr Asp Gin Ser Leu Gly Thr Ser Leu Ser Gin Tyr Ala Phe Gly 
85 90 95 

Ser Ser Gly Glu Gly Leu Ala Gly Val Ala Ser Val Gly Gly Gin Pro 
100 105 110 

Ser Gin Ala Thr Gin Leu Leu Ser Thr Pro Val Ser Gin Val Thr Thr 
115 120 125 

Gin Leu Gly Glu Thr Ala Ala Glu Leu Ala Pro Arg Val Val Ala Thr 
130 135 140 

Val Pro Gin Leu Val Gin Leu Ala Pro His Ala Val Gin Met Ser Gin 
145 150 155 160 

Asn Ala Ser Pro lie Ala Gin Thr lie Ser Gin Thr Ala Gin Gin Ala 
165 170 175 

Ala Gin Ser Ala Gin Gly Gly Ser Gly Pro Met Pro Ala Gin Leu Ala 
180 \ 155 Lai) 

Ser Ala Glu Lys Pro Ala Thr Glu Gin Ala Glu Pro Val His Glu Val 
195 200 205 

Thr Asn Asp Asp Gin Gly Asp Gin Gly Asp Val Gin Pro Ala Glu Val 
210 215 220 



Val Ala Ala Ala Arg Asp Glu Gly Ala Gly Ala Ser Pro Gly Gin Gin 

225 230 235 240 

Pro Gly Gly Gly Val Pro Ala Gin Ala Met Asp Thr Gly Ala Gly Ala 

245 250 255 



Arg Pro Ala Ala Ser Pro Leu Ala Ala Pro Val Asp Pro Ser Thr Pro 
•■ 260 265 270 



55 



Ala Pro Ser Thr Thr Thr Thr Leu 
275 280 



<210> 69 

<211> 729 

<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3879c - hypothetical alanine and proline rich 
protein 

<400> 69 

Met Ser lie Thr Arg Pro Thr Gly Ser Tyr Ala Arg Gin Met Leu Asp 
15 10 15 

Pro Gly Gly Trp Val Glu Ala Asp Glu Asp Thr Phe Tyr Asp Arg Ala 
20 25 30 

Gin Glu Tyr Ser Gin Val Leu Gin Arg Val Thr Asp Val Leu Asp Thr 
35 40 45 

Cys Arg Gin Gin Lys Gly His Val Phe Glu Gly Gly Leu Trp Ser Gly 
50 55 60 

Gly Ala Ala Asn Ala Ala Asn Gly Ala Leu Gly Ala Asn lie Asn Gin 
65 70 75 80 

Leu Met Thr Leu Gin Asp Tyr Leu Ala Thr Val He Thr Trp His Arg 
85 90 95 

His He Ala Gly Leu He Glu Gin Ala Lys Ser Asp lie Gly Asn Asn 
100 105 110 

Val Asp Gly Ala Gin Arg Glu He Asp He Leu Glu Asn Asp Pro Ser 
115 120 125 

Leu Asp Ala Asp Glu Arg His Thr Ala He Asn Ser Leu Val Thr Ala 
130 135 140 

Thr His Gly Ala Asn Val Ser Leu Val Ala Glu Thr Ala Glu Arg Val 
A45 150 155 160. 

Leu Glu Ser Lys Asn Trp Lys Pro Pro Lys Asn Ala T.ru Gin AF;p T.ftu 



165 170 175 

Leu Gin Gin Lys Ser Pro Pro Pro Pro Asp Val Pro Thr Leu Val Val 
180 ' 185 190 

Pro Ser Pro Gly Thr Pro Gly Thr Pro Gly Thr Pro He Thr Pro Gly 
195 200 205 

Thr Pro He Thr Pro Gly Thr Pro He Thr Pro He Pro Gly Ala Pro 
210 215 220 

Val Thr Pro He Thr Pro Thr Pro Gly Thr Pro Val Thr Pro Val Thr 
225 .230 235 240 



Pro Gly Lys Pro Val Thr Pro Val Thr Pro Val Lys Pro G ly Thr Pt-o 
245 250 255 
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Gly Glu Pro Thr Pro lie Thr Pro Val Thr Pro Pro Val Ala Pro Ala 
260 265 -270 

Thr Pro Ala Thr Pro Ala Thr Pro Val Thr Pro Ala Pro Ala Pro His 
275 . 280 285 

Pro Gin .Pro Ala Pro Ala Pro Ala Pro Ser Pro Gly Pro Gin Pro Val 
290 295 300 

Thr Pro Ala Thr Pro Gly Pro Ser Gly Pro Ala Thr Pro Gly Thr Pro 
305 310 315 320 

Gly Gly Glu Pro Ala Pro His Val Lys Pro Ala Ala Leu Ala Glu Gin 
325 330 335 

Pro Gly Val Pro Gly Gin His Ala Gly Gly Gly Thr Gin Ser Gly Pro 
340 345 350 

Ala His Ala Asp Glu Ser Ala Ala Ser Val Thr Pro Ala Ala Ala Ser 
355 360 365 

Gly Val Pro Gly Ala Arg Ala Ala Ala Ala Ala Pro Ser Gly Thr Ala 
370 375 380 

Val Gly Ala Gly Ala Arg Ser Ser Val Gly Thr Ala Ala Ala Ser Gly 
385 390 395 400 

Ala Gly Ser His Ala Ala Thr Gly Arg Ala Pro Val Ala Thr Ser Asp 
405 410 415 

Lys Ala Ala Ala Pro Ser Thr Arg Ala Ala Ser Ala Arg Thr Ala Pro 
420 425 430 

Pro Ala Arg Pro Pro Ser Thr Asp His lie Asp Lys Pro Asp Arg Ser 
435 440 445 

Glu Ser Ala Asp Asp Gly Thr Pro Val Ser Met lie Pro Val Ser Ala 
450 455 460 

Ala Arg Ala Ala Arg Asp Ala Ala Thr Ala Ala Ala Ser Ala Arg Gin 
465 470 475 480 

Arg Gly Arg Gly Asp Ala Leu Arg Leu Ala Arg Arg lie Ala Ala Ala 
45^ 4 95 



Leu Asn Ala Ser Asp Asn Asn Ala Gly Asp Tyr Gly Phe Phe Trp lie 

500 505 510 

Thr Ala Val Thr Thr Asp Gly Ser lie Val Val Ala Asn Ser Tyr Gly 

515 520 525 

Leu Ala Tyr lie Pro Asp Gly Met Glu Leu Pro Asn Lys Val Tyr Leu 

530 535 540 

Ala Ser Ala Asp His Ala lie Pro Val Asp Glu lie Ala Arg Cys Ala 

545 550 ' 555 560 



Thr Tyr Pro Val Leu Ala Val Gin Ala Trp Ala Ala Phe His Asp Met 
565 570 575 



Thr Leu Arg Ala 
580 

. Pro Gly Val Ala 
595 

Gly Lys Met Thr 
610 

Ala Ala Gin Leu 
625 

Pro Pro Ala Pro 



Leu Trp Phe Glu 
660 

Glu Ala Ala His 
675 

Glu lie Ala Leu 
690 




Val He Gly Thr 



Lys He Val Leu 
600 

Gly Arg Ser Arg 
615 

Ala Asp Thr Thr 
630 

Val Asp Val Asn 
645 

Leu Met Lys Pro 



Leu Arg Ala Phe 
680 

His Gin Ala His 
695 
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Ala Glu Gin Leu 

585 

Glu Pro Asp Asp 



Leu Glu Val Val 
620 

Asp Gin Arg Leu 
635 

Pro Pro Gly Asp 
650 

Met Thr Ser Thr 
665 

Arg Ala Tyr Ala 



Thr Ala Thr Asp 
700 




Ala Ser Ser Asp 
590 

He Pro Glu Ser 
605 

Asp Pro Ser Ala 



Leu Asp Leu Leu 
640 

Glu Arg His Met 
655 

Ala Thr Gly Arg 
670 

Ala His Ser Gin 
685 

Ala Ala Val Gin 



Arg Val Ala Val Ala Asp Trp Leu Tyr Trp Gin Tyr Val Thr Gly Leu 
705 710 715 720 

Leu Asp Arg Ala Leu Ala Ala Ala Cys 
725 



<210> 70 
<211> 115 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3880c - conserved hypothetical protein 

<400> 70 

Val Ser Met Asp Glu Leu Asp Pro His Val Ala Arg Ala Leu Thr Leu 
15 10 15 

Ala Ala Arg Phe Gin Ser Ala Leu Asp Gly Thr Leu Asn Gin Met Asn 

20 25 3i3 



Asn Gly Ser Phe Arg Ala Thr Asp Glu Ala Glu Thr Val Glu Val Thr 
35 40 45 

He Asn Gly His Gin Trp Leu Thr Gly Leu Arg He Glu Asp Gly Leu 
50 55 60 

Leu Lys Lys Leu Gly Ala Glu Ala Val Ala. Gin Arg Val Asn Glu Ala 
' 65 70 75 80 

Leu His Asn Ala Gin Ala Ala Ala Ser Ala Tyr Asn Asp Ala Ala Gly 
85 90 95 



Glu Gin Leu Thr Ala Ala Leu Ser Ala Met Ser Arg Ala Met Asn Glu 
100 105 llfl 



58 



Gly Met Ala 
115 



<210> 71 

<211> 460 

<212> PRT . - 

<213> Mycobacterium tuberculosis 

<220> 

<223> Rv3 8 81c - conserved hypothetical alanine and 
glycine rich protein 

<400> 71 

Met Thr Gin Ser Gin Thr Val Thr Val Asp Gin Gin Glu lie Leu Asn 
15 10 15 

Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val 
20 25 30 

' Pro lie Thr Pro Cys Glu Leu Thr Ala Ala Lys Asn Ala Ala Gin Gin 
35 .40 45 

' Leu Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala 
.50 55 60 

Lys Glu Arg Gin Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Ala 
65 70 75 80 

Tyr Gly Glu Val Asp Glu Glu Ala Ala* Thr Ala Leu Asp Asn Asp Gly 
85 90 95 

Glu. Gly Thr Val Gin Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser 
100 105 110 

Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro 
115 120 125 

Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp 
13.0 ... 135 14.0 

Gin Gly Ala Ser Leu Ala His Phe Ala Asp Gly Trp Asn Thr Phe Asn 
145 150 155 160 

Leu Thr Leu Gin Gly Asp Val Lys Arg Phe Arg Gly Phe Asp Asn Trp 
165 170 175 

Glu Gly Asp Ala Ala Thr Ala Cys Glu Ala Ser Leu Asp Gin Gin Arg 
180 185 190 

Gin Trp lie Leu His Met Ala Lys Leu Ser Ala Ala Met Ala Lys Gin 
195 200 205 

Ala Gin Tyr Val Ala Gin Leu His Val Trp Ala Arg Arg Glu His Pro 
210 215 220 

Thr Tyr Glu Asp lie Val Gly Leu Glu Arg Leu Tyr Ala Glu Asn Pro 
225 230 235 240 

Ser Ala Arg Asp Gin lie Leu Pro Val Tyr Ala Glu Tyr Gin Gin Arg 
245 250 255 



59 



Ser Glu Lys Val Leu Thr Glu Tyr Asn Asn Lys Ala Ala Leu Glu Pro 
260 265 270 

Val Asn Pro Pro Lys Pro Pro Pro Ala lie Lys lie Asp Pro Pro Pro 
275 \ 280 285 

Pro Pro Gin Glu Gin Gly Leu lie Pro Gly Phe Leu Met Pro Pro Ser 
290 295 300 

Asp Gly Ser Gly Val Thr Pro Gly Thr Gly Met Pro Ala Ala Pro Met 

305 310 315 320 

Val Pro Pro Thr Gly Ser Pro Gly Gly .Gly Leu Pro Ala Asp Thr Ala 
325 330 335 

Ala Gin Leu Thr Ser Ala Gly Arg Glu Ala Ala Ala Leu Ser Gly Asp 
340 345 350 

Val Ala Val Lys Ala Ala Ser Leu Gly Gly Gly Gly Gly Gly Gly Val 
355 360 365 

Pro Ser Ala Pro Leu Gly Ser Ala lie Gly Gly Ala Glu Ser Val Arg 
370 375 380 

Pro Ala Gly Ala Gly Asp lie Ala Gly Leu Gly Gin Gly Arg Ala Gly 
385 390 395 400 

Gly Gly Ala Ala Leu Gly Gly Gly Gly Met Gly Met Pro Met Gly Ala 
405 410 415 

Ala His Gin Gly Gin Gly Gly Ala Lys Ser Lys Gly Ser Gin Gin Glu 
420 425 430 

Asp Glu Ala Leu Tyr Thr Glu Asp Arg Ala Trp Thr Glu Ala Val lie 
435 440 445 

Gly Asn Arg Arg Arg Gin Asp Ser Lys Glu Ser Lys 
450 455 460 



<210> 72 
<211> 462 
<212> PRT 

<213> Mycobacterium tuberculosis 



<220> 

<223> Rv3882c - possible conserved membrane protein 

<400> 72 

Met Arg Asn Pro Leu Gly Leu Arg Phe Ser Thr Gly His Ala Leu Leu 
15 10 15 

Ala Ser Ala Leu Ala Pro Pro Cys lie lie Ala Phe Leu Glu Thr Arg 
20 25 30 

Tyr Trp Trp Ala Gly lie Ala Leu Ala Ser Leu Gly Val lie Val Ala 
35 • 40 . 45 



Thr Val Thr Phe Tyr Glv Arg Arg lie Thr Gly Trp Val Ala Ala Val 
50 55 60 



60 



Tyr Ala Trp Leu Arg Arg Arg Arg Arg Pro Pro Asp Ser Ser Ser Glu 
65 70 75 80 

Pro Val Val Gly Ala Thr Val Lys Pro Gly Asp His Val Ala Val Arg 
.85 90 .95 

Trp Gin Gly Glu Phe Leu Val Ala Val lie Glu Leu lie Pro Arg Pro 
100 105 110 

Phe Thr Pro Thr Val lie Val Asp Gly Gin Ala His Thr Asp Asp Met 
115 120 125 

Leu Asp Thr Gly Leu Val Glu Glu Leu Leu Ser Val His Cys Pro Asp 
130 135 140 

Leu Glu Ala Asp lie Val Ser Ala Gly Tyr Arg Val Gly Asn Thr Ala 
145 150 155 160 

Ala Pro Asp Val Val Ser Leu Tyr Gin Gin Val lie Gly Thr Asp Pro 
165 170 175 

Ala Pro Ala Asn Arg Arg Thr Trp lie Val Leu Arg Ala Asp Pro Glu 
180 185 190 

Arg Thr Arg Lys Ser Ala Gin Arg Arg Asp Glu Gly Val Ala Gly Leu 
195 200 205 

Ala Arg Tyr Leu Val Ala Ser Ala Thr Arg lie Ala Asp Arg Leu Ala 
210 215 220 

Ser His Gly Val Asp Ala Val Cys Gly Arg Ser Phe Asp Asp Tyr Asp 
225 230 235 240 

His Ala Thr Asp lie Gly Phe Val Arg Glu Lys Trp Ser Met lie Lys 
245 250 255 

Gly Arg Asp Ala Tyr Thr Ala Ala Tyr Ala Ala Pro Gly Gly Pro Asp 
260 265 270 

Val Trp Trp Ser Ala Arg Ala Asp His Thr lie Thr Arg Val Arg Val 
275 280 285 

Ala Pro Gly Met Ala Pro Gin Ser Thr Val Leu Leu Thr Thr Ala Asp 
2SO 2SS 3XX0 



Lys Pro Lys Thr Pro Arg Gly Phe Ala Arg Leu Phe Gly Gly Gin Arg 
305 310 315 320 



Pro Ala Leu Gin Gly Gin His Leu Val Ala Asn Arg His Cys Gin Leu 
325 330 335 

Pro He Gly Ser Ala Gly Val Leu Val Gly Glu Thr Val Asn Arg Cys 
340 345 350 

Pro Val Tyr Met Pro Phe Asp Asp Val Asp He Ala Leu Asn Leu Gly 
355 360 365 

Asp Ala Gin Thr Phe Thr Gin Phe Val Val Arg Ala Ala Ala Ala Gly 
^7n \ 3JZ5 iai) 
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Ala Met Val Thr Val Gly Pro Gin Phe Glu Glu Phe Ala Arg Leu lie 
385 390 395 400 

Gly Ala His lie Gly Gin Glu Val Lys Val Ala Trp Pro Asn Ala Thr 
405 410 415 

Thr Tyr Leii Gly Pro His Pro Gly lie Asp Arg Val lie Leu Arg His 
420 425 430 

Asn Val lie Gly Thr Pro Arg His Arg Gin Leu Pro lie Arg Arg Val 
435 440 445 

Ser Pro Pro Glu Glu Ser Arg Tyr Gin Met Ala Leu Pro Lys 
450 455 460 



<210> 73 
<211> 446 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3883c - possible secreted protease 
<400> 73 

Val His Arg lie Phe Leu' lie Thr Val Ala Leu Ala Leu Leu Thr Ala 
1 5 10 15 

Ser Pro Ala Ser Ala He Thr Pro Pro Pro He Asp Pro Gly Ala Leu 
20 25 30 

Pro Pro Asp Val Thr Gly Pro Asp Gin Pro Thr Glu Gin Arg Val Leu 
35 . 40 45 

Cys Ala Ser Pro Thr Thr Leu Pro Gly Ser Gly Phe His Asp Pro Pro 
50 55 60 

Trp Ser Asn Thr Tyr Leu Gly . Val Ala Asp Ala His Lys Phe Ala Thr 
65 70 75 80 

Gly Ala Gly Val Thr Val Ala Val He Asp Thr Gly Val Asp Ala Ser 
85 90 "" 95 

Pro Arg Val Pro Ala Glu Pro Gly Gly Asp Phe Val Asp Gin Ala Gly 
IDXi [ XD3 Hfl 



Asn Gly Leu Ser Asp Cys Asp Ala His Gly Thr Leu Thr Ala Ser He 
115 120 125 

He Ala Gly Arg Pro Ala Pro Thr Asp Gly Phe Val Gly Val Ala Pro 
130 135 140 

Asp Ala Arg Leu Leu Ser Leu Arg Gin Thr Ser Glu Ala Phe Glu Pro 
145 150 155 160 

Val Gly Ser Gin Ala Asn Pro Asn Asp Pro Asn Ala Thr Pro Ala Ala 
165 170 .175 



Gly Ser He Arg Ser Leu Ala Arg Ala Val Val His Ala Ala Asn Leu 

180 185 ] : lan 
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Gly Val Gly Val lie Asn lie Ser Glu Ala Ala Cys Tyr Lys Val Ser 
195 200 205 

Arg Pro lie Asp Glu Thr Ser Leu Gly Ala Ser lie Asp Tyr Ala Val 
210 215 220 

Asn Val Lys Gly Val Val Val Val Val Ala Ala Gly Asn Thr Gly Gly 
225 . 230 235 240 

Asp Cys Val Gin Asn Pro Ala Pro Asp Pro Ser Thr Pro Gly Asp Pro 
245 250 255 

Arg Gly Trp Asn Asn Val Gin Thr Val Val Thr Pro Ala Trp Tyr Ala 
260 265 270 

Pro Leu Val Leu Ser Val Gly Gly lie Gly Gin Thr Gly Met Pro Ser 

275 280 285 

Ser Phe Ser Met His Gly Pro Trp Val Asp Val Ala Ala Pro Ala Glu 
290 295 300 

Asn lie Val Ala Leu Gly Asp Thr Gly Glu Pro Val Asn Ala Leu Gin 
305 310 315 320 

Gly Arg Glu Gly Pro Val Pro lie Ala Gly Thr Ser Phe Ala Ala Ala 
325 330 335 

Tyr Val Ser Gly Leu Ala Ala Leu Leu Arg Gin Arg Phe Pro Asp Leu 
340 345 350 

Thr Pro Ala Gin lie lie His Arg He Thr Ala Thr Ala Arg His Pro 
355 360 365 

Gly Gly Gly Val Asp Asp Leu Val Gly Ala Gly Val He Asp Ala Val 
370 375 380 

Ala Ala Leu Thr Trp Asp He Pro Pro Gly Pro Ala Ser Ala Pro Tyr 
385 390 395 400 

Asn Val Arg Arg Leu Pro Pro Pro Val Val Glu Pro Gly Pro Asp Arg 
405 410 415 

Arg Pro He Thr Ala Val Ala Leu Val Ala Val Gly Leu Thr Leu Ala 
420 425 430 



Leu Gly Leu Gly Ala Leu Ala Arg Arg Ala Leu Ser Arg Arg 
435 440 445 



<210> 74 
<211> 619 
<212> PRT 

<213> Mycobacterium tuberculosis 

<220> 

<223> Rv3884c - probable CBXX/CFQX family protein 



<400> 74 

Met Ser Arg Met Val Asp Thr Met Gly Asp Leu Leu Thr Ala Arg Arg 

5 10 : 15 




Hi's 


Phe 


Asp 


Arg 
20 


Ala 


Met 


Thr 


He 


Lys 
25 


Asn 


Gly Gin Gly 


Cys 
30 


Val 


Ala 


Ala 


Leu 


Pro 
35 


Glu 


Phe 


Val 


Ala 


Ala 
40 


Thr 


Glu 


Ala 


Asp 


Pro 
45 


Ser 


Met 


Ala 


Asp 


Ala 
50 


Trp 


Leu 


Gly. Arg 


He. 
55 


Ala 


Cys 


Gly 


Asp 


-Arg 
60 


Asp 


Leu 


Ala 


Ser 


Leu 
65 


Lys 


Gin 


Leu 


Asn 


Ala 
70 


His 


Ser 


Glu 


Trp 


Leu 
75 


His 


Arg 


Glu 


Thr 


Thr 
80 


Arg 


He 


Gly Arg 


Thr 
85 


Leu 


Ala 


Ala 


Glu 


Val 
90 


Gin 


Leu Gly 


Pro 


Ser 
95 


He 


Gly 


He 


Thr 


Val 
100 


Thr 


Asp 


Ala 


Ser 


Gin 
105 


Val 


Gly 


Leu 


Ala 


Leu 
110 


Ser 


Ser 


Ala 


Leu 


Thr 
115 


He 


Ala 


Gly 


Glu 


Tyr 
120 


Ala 


Lys 


Ala 


Asp 


Ala 
125 


Leu 


Leu 


Ala 


Asn 


Arg 
130 


Glu 


Leu 


Leu 


Asp 


Ser 
135 


Trp 


Arg 


Asn 


Tyr 


Gin 
14 0 


Trp 


His 


Gin 


Leu 


Ala 
145 


Arg 


Ala 


Phe 


Leu 


Met 
150 


Tyr 


Val 


Thr 


Gin 


Arg 
155 


Trp 


Pro 


Asp 


Val 


Leu 
160 


Ser 


Thr 


Ala 


Ala 


Glu 
165 


Asp 


Leu 


Pro 


Pro 


Gin 
170 


Ala 


He 


Val 


Met 


Pro 
175 


Ala 


Val 


Thr 


Ala 


Ser 
180 


He 


Cys 


Ala 


Leu 


Ala 
185 


Ala 


His 


Ala 


Ala 


Ala 
190 


His 


Leu 


Gly 


Gin 


Gly Arg 
195 


Val 


Ala 


Leu 


Asp 
200 


Trp 


Leu 


Asp 


Arg 


Val 
205 


Asp 


Val 


He 


Gly His 
210 


Ser 


Arg 


Ser 


Ser 


Glu 
215 


Arg 


Phe 


Gly 


Ala 


Asp 
220 


Val 


Leu 


Thr 


Ala 


Ala 
225 


He 


Gly 


Pro 


Ala 


Asp 
230 


He 


Pro 


Leu 


Leu 


Val 
235 


Ala 


Asp 


Leu 


Ala 


Tyr 
240 


Val 


Arg 


Gly Met 


Val 
245 


Tyr 


Arg 


Gin 


Leu 


His 
250 


Glu 


Glu 


Asp 


Lys 


Ala 
255 


Gin 


He 


Trp 


Leu 


Ser 
260 


Lys 


Ala 


Thr 


He 


Asn 
265 


Gly 


Val 


Leu 


Thr 


Asp 
270 


Ala 


Ala 


Lys 


Glu 


Ala 
275 


Leu 


Ala 


Asp 


Pro 


Asn 
280 


Leu 


Arg 


Leu 


He 


Val 
285 


Thr 


Asp 


Glu 


Arg 


Thr 
290 


He 


Ala 


Ser 


Arg 


Ser 
295 


Asp 


Arg 


Trp 


Asp 


Ala 
300 


Ser 


Thr 


Ala 


Lys 


Ser 
305 


Arg 


Asp 


Gin 


Leu 


Asp 
310 


Asp 


Asp 


Asn 


Ala 


Ala 
315 


Gin 


Arg 


Arg 


Gly 


Glu 
320 


Leu 


Leu 


Ala 


Glu 


Gly 
325 


Arg 


Glu 


Leu 


Leu 


Ala 
330 


Lys 


Gin 


Val 


Gly 


.Leu 
335 


Ala 



Ala Val Lys Gin Ala Val Ser Ala Leu Glu Asp Gin Leu Glu Val Arg 



.0L ^ » 
f 
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340 345 350 

Met Met Arg Leu Glu His Gly Leu Pro Val Glu Gly Gin Thr Asn His 
355 360 365 

Met Leu Leu Val Gly Pro Pro Gly Thr Gly Lys Thr Thr Thr Ala Glu 

370, ■ 375 380 

Ala Leu Gly Lys lie Tyr Ala Gly Met Gly He Val Arg His Pro Glu 
385 390 395 400 

He Arg Glu Val Arg Arg Ser Asp Phe Cys Gly His Tyr He Gly Glu 
405 410 415 

Ser Gly Pro Lys Thr Asn Glu Leu He Glu Lys Ser Leu Gly Arg He 
420 425 430 

He Phe Met Asp Glu Phe Tyr Ser Leu He Glu Arg His Gin Asp Gly 
435 440 445 

Thr Pro Asp Met He Gly Met Glu Ala Val Asn Gin Leu Leu Val Gin 
450 455 460 

Leu Glu Thr His Arg Phe Asp Phe Cys Phe He Gly Ala Gly Tyr Glu 
465 470 475 480 

Asp Gin Val Asp Glu Phe Leu Thr Val Asn Pro Gly Leu Ala Gly Arg 
485 490 495 

Phe Asn Arg Lys Leu Arg Phe Glu Ser Tyr Ser Pro Val Glu He Val 
500 505 510 

Glu He Gly His Arg Tyr Ala Thr Pro Arg Ala Ser Gin Leu Asp Asp 
515 520 525 

Ala Ala Arg Glu. Val Phe Leu Asp Ala Val Thr Thr He Arg Asn Tyr 
530 535 54.0 

Thr Thr Pro Ser Gly Gin His Gly He Asp Ala Met Gin Asn Gly Arg 
545 550 555 560 

Phe Ala Arg Asn Val He Glu Arg Ala Glu Gly Phe Arg Asp Thr Arg 
565 570 575 

Val Val Ala Gin Lys Arc Ala Gly Gin Pro Val Ser Val Gin Asp Leu 
580 585 590 

Gin He He Thr Ala Thr Asp He Asp Ala Ala He Arg Ser Val Cys 
595 600 605 

Ser Asp Asn Arg Asp Met Ala Ala He Val Trp 
610 615 



<210> 75 

<211> 537 

<212> PRT 

<213> Mycobacterium tuberculosis 



<220> 

<223> Rv3885c - possible conserved membrane protein 




Lys Leu Thr Gly Phe Ser Pro Arg Ser Ala Arg Arg Val 
5 10 15 

Ala Gly Val Trp Thr Val Phe Val Leu Ala Ser Ala Gly Trp Ala Leu 

20 • - 25 ■ • 30 - 

Gly Gly Gin Leu Gly Ala Val Met Ala Val Val Val Gly Val Ala Leu 
35 40 45 



<400> 75 
Leu Thr Ser 
1 



Val Phe Val Gin Trp Trp Gly Gin Pro Ala Trp Ser Trp Ala Val Leu 
50 55 60 

Gly Leu Arg Gly Arg Arg Pro Val Lys Trp Asn Asp Pro lie Thr Leu 
65 70 75 80 

Ala Asn Asn Arg Ser Gly Gly Gly Val Arg Val Gin Asp Gly Val Ala 
85 90 95 

Val Val Ala Val Gin Leu Leu Gly Arg Ala His Arg Ala Thr Thr Val 
100 105 110 



Thr Gly Ser Val Thr Val Glu Ser Asp Asn Val lie Asp Val Val Glu 

115 120 125 

Leu Ala Pro Leu Leu Arg His Pro Leu Asp Leu Glu Leu Asp Ser lie 

130 135 140 

Ser Val Val Thr Phe Gly Ser Arg Thr Gly Thr Val Gly Asp Tyr Pro 

145 150 155 160 



Arg Val Tyr Asp Ala Glu lie Gly Thr Pro Pro Tyr Ala Gly Arg Arg 

165 170 175 

Glu Thr Trp Leu lie Met Arg Leu Pro Val lie Gly Asn Thr Gin Ala 

180 185 190 

Leu Arg Trp Arg Thr Ser Val Gly Ala Ala Ala lie Ser Val Ala Gin 

195 200 205 

Arg Val Ala . Ser Ser Leu Arg Cys Gin Gly Leu Arg Ala Lys Leu Ala 

210 215 220 

Thr Ala Thr- Asp T.g^ii Ala fll n Lgu Afip Arg AT-g T.f>ii niy fi<=>-r Aap Ala 



225 230 235 240 

Val Ala Gly Ser Ala Gin Arg Trp Lys Ala lie Arg Gly Glu Ala Gly 
245 250 255 

Trp Met Thr Thr Tyr Ala Tyr Pro Ala Glu Ala lie Ser Ser Arg Val 

260 265 270 

Leu Ser Gin Ala Trp Thr Leu Arg Ala Asp Glu Val lie Gin Asn Val 

275 . - 280 285 

Thr Val Tyr Pro Asp Ala Thr Cys Thr Ala Thr lie Thr Val Arg Thr 

290 295 300 



Pro Thr Pro Ala Pro Thr Pro Pro Ser Val Tie Leu Arg Arg Leu Asn 
305 310 315 320 



Gly Glu Gin Ala Ala Ala Ala Ala Ala Asn Met Cys Gly Pro ^Arg Pro 
325 330 * 335 

His Leu Arg Gly Gin Arg Arg Cys Pro Leu Pro Ala Gin Leu Val Thr 
- -340 . 345 350 7 * 

Glu lie Gly Pro Ser Gly Val Leu lie Gly Lys Leu .Ser Asn Gly Asp 
355 360 "365^ 

Arg Leu Met lie Pro Val Thr Asp Ala Gly Glu Leu Ser Arg Val Phe 
370 375 380 

Val Ala Ala Asp Asp Thr lie Ala Lys Arg lie Val lie Arg Val Val 
385 390 395 400 

Gly Ala Gly Glu Arg Val Cys Val His Thr Arg Asp Gin Glu Arg Trp 
405 410 415 

Ala Ser Val Arg Met Pro Gin Leu Ser lie Val Gly Thr Pro Arg Pro 
420 425 430 

Ala Pro Arg Thr Thr Val Gly Val Val Glu Tyr Val Arg Arg Arg Lys 
435 440 445 

Asn Gly Asp Asp Gly Lys Ser Glu Gly Ser Gly Val Asp Val Ala lie 
450 455 460 

Ser Pro Thr Pro Arg Pro Ala Ser Val lie Thr lie Ala Arg Pro Gly 
465 470 475 480 

Thr Ser Leu Ser Glu Ser Asp Arg His Gly Phe Glu Val Thr lie Glu 
485 490 495 

Gin lie Asp Arg Ala Thr Val Lys Val Gly Ala Ala Gly Gin Asn Trp 
500 505 510 

Leu Val Glu Met Glu Met Phe Arg Ala Glu Asn Arg Tyr Val Ser Leu 
515 520 525 



Glu Pro Val Thr Met Ser lie Gly Arg 
530 535 



